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McMorris,  J.  A. ,  II 
Mendell,  L. 

Mendelson,  J. 

Mermelstein,  P. 

Morse,  D.  L. 

Mussi,  R.  N.  F. 

Naves,  F.. 

Nelson,  D:  E. 

(Teaching  Assistant) 
Nolan,  J.  J.,  Jr. 

Nowak,  R.  T. 

Oder,  R.  R, 

Olsen,  J.  H. 

Omura,  J.  K. 

(National  Science 
Foundation  Fellow) 

Paul,  A.  P. 

Pauwels,  H.  (Belgium- 
American  Educational 
Foundation  Fellow) 
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Petrie,  L.  M. ,  Jr. 

(Teaching  Assistant) 
Pfaff,  D.  W. 

Pfeiffer,  R.  R. 

Phipps,  C.  R.,  Jr. 
Pilla,  M.  A. 

(Teaching  Assistant) 
Prabhu,  V.  K. 
Praddaude,  H.  C. 
Pruslin,  D.  H. 

Puri,  S. 

(Teaching  Assistant) 
Qualls,  C.  B. 

Radoski,  H.  R. 
Rahmani,  Mahin 
Redi,  O. 

Reid,  M.  H. 

Reisman,  S.  S. 

(National  Science 
'  Foundation  Fellow) 
Richters,  J.  S. 

(National  Science  1 
Foundation  Fellow) 
Rizzo,  J.- 
Roberts,  L.  G. 

Rogoff,  G.  L. 

Rook,  C.  W. ,  Jr. 
Rosenbaum,  P.  S. 
Rosenshein,  J.  S. 
Rowe,  A.  W. 

Rummler,  W.  D. 

Sachs,  M.  B. 

(Sloan  Fellow) 
Sandberg,  A.  A. 
Savage,  J.  E. 
Schlossburg,  H. 
Schuler,  C.  J.,  Jr. 
Serafim,  P.  E. 
Shaffner,  R.  O. 

(Teaching  Assistant) 
Shane,  J.  R. 

(K  ennic  ot-  C  oppe  r 
Fellow) 

Shimony,  U. 

Simon,  R.  W; 

Simpson,  J.  I. 

Sklar,  J.  R. 

Smith,  W.  S. ,  Jr. 
Smith,  W.  W. 

Smythe,  D.  L. ,  Jr. 

(Teaching  Assistant) 
Snodderly,  D.  M. ,  Jr. 
Sobel,  I. 

Solbes,  A. 

Spangler,  P.  S. 


Staelin,  D.  H. 

Stanten,  S.  F.  (Hughes 
Aircraft  Fellow) 
Steele,  D.  W. 

(Sloan  Fellow) 
Steinbrecher,  D.  H. 
Stiglitz,  I.  G. 

Strong,  W.  J. 
Sutherland,  W.  R. 
(National  Science 
Foundation  Fellow) 
Taub,  A.  (National 
Institutes  of  Health 
Fellow) 

Taylor,  T.  E. 

(I.C.T.  Fellow) 
Tepley,  N. 

Theodoridis,  G. 

Thiele,  A.  A. 

Thomae,  I.  H. 

(National  Science 
Foundation  Fellow) 
Thomas,  L.  C. 
(National  Science 
Foundation  Fellow) 
Thornburg,  C.  O. ,  Jr. 
Tobey,  M.  C.,  Jr. 
(Teaching  Assistant) 


Aaron,  A.  M. 
Aponick,  A.  A. 
Bayles,  R.  U. 
Brincko,  A.  J. 
Byers,  F.  H. 
Colodny,  Susan  F. 
Deininger,  C.  C, 
Diephuis,  R.  J. 
Ecklein,  D. 

Frasco,  L.  A. 
Gadzuk,  J.  W. 
Grabowski,  R.  E. 
Guinan,  J.  J. ,  Jr. 
Hamlin,  R.  M. ,  Jr. 


Addis,  J.  L. 
Arnold,  P.  G. 
Baecker,  R.  M. 
Baugh,  W.  H. .  Ill 
Bender.  M.  H. 


Graduate  Students  (continued) 

Tomlinson,  W.  J.,  Ill 
Topaz,  J.  M. 

Tretiak,  O.  J. 

Tse,  F.  Y-F. 

Tulenko,  J.  S. 

(U.S.  AEC  Fellow) 
Ulrich,  P.  B. 

Vanderweil,  R.  G. 

Van  Horn,  E.  C.,  Jr. 

(Teaching  Assistant) 
Wade,  C.  G. 

Wagner,  C.  E. 

(Teaching  Assistant) 
Waletzko,  J.  A. 

Wang,  G.  Y-C. 

Ward,  R.  L. 

(IBM  Fellow) 
Wasserman,  G.  S. 

Weiss,  T.  F. 

Welch,  J.  R. 

(Teaching  Assistant) 
Whitman,  E.  C. 
Wickelgren,  G.  1,. 
Wiederhold,  M.  L. 

Wilde,  D.  U. 

(Teaching  Assistant) 
Wilde,  G.  R. 


Undergraduates 

Hirschfeld,  R.  A. 
Johnson,  I.  S.  C. 
Koons,  H.  C. 

Libman,  S.  M. 

Lindes,  P. 

Mann,  C.  A.,  Jr. 
Marchese,  P.  S. Jr. 
Murray,  M.  J. 

Nelson,  G.  P. 

Ng,  L.  C. 

Norris,  R.  B. ,  Jr. 
O'Halloran,  W.  F. ,  Jr. 
Parchesky,  J. 

Perrolle,  P:  M. 
Peterson,  B.  A. 


Senior  Thesis  Students 

Books,  G..  E. 
Butterfield,  B.O. 
Carson,  J.  F. 

Chu,  Y.  H. 

Cohn,  T.  E. 


Wilensky,  S. 
Williamson,  R.  C. 
Willke,  H.  L.,  Jr. 
Wilson,  G.  L. 

(Teaching  Assistant) 
Wilson,  W.  J. 

(Teaching  Assistant) 
Winett,  J.  M. 
Wissmiller,  J.  C 
(National  Science 
Foundation  Fellow) 
Witting,  H.  L. 

(U.  S.  AEC  Fellow) 
Witting,  J.  M. 

(I.T.T.  Fellow) 
Witzke,  K.  G. 

Wolf,  R.  P. 

Woo,  J.  C. 

Wright,  B.  L. 

Wright,  J.  S. 

Yarnell,  C.  F. 

Yudkin,  H.  L. 

Zeiger,  H.  P. 

Zuber,  B.  L. 
Zuckerman.  B.  M. 
Zwicky,  A.  M. ,  Jr. 
(National  Defense 
Education  Act  Fellow) 


Piner,  S.  D. 
Rome,  J.  A. 

Rush,  ft.  D. 

Scott,  T.  A. 
Siemens,  P.  J. 
Smith,  D.  F. 
Speer,  E.  R. ,  Jr. 
Stark,  M.  F. 

Wan,  A.  C.  M. 
Weidner,  M.  Y. 
Williams,  J.  A. 
Wolfberg,  M.  S. 
Yap,  B.  K. 
Zucker,  R.  S. 


Cooper,  W.  W. .  IV 
DiGregorio,  R.  F. 
Dralle,  A.  V. 
Einolf,  C.  W. ,  Jr. 
Ellms,  S.  R. 
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PERSONNEL 


Senior  Thesis  Students  (continued) 


Emerson,  S.  T. 
Epstein,  M.  R. 
Evans,  S.  A. 
Feezor,  M.  D. 
Fevrier,  A. 

Fiory,  A.  T. 
Flicker,  J.  K. 
Graham,  J.  D. 
Hadden,  W.  J.,  Jr. 
Hassan,  A.  R. 
Hegyi,  D.  J. 

Hiller,  D.  E. 
Hoffman,  E.  J. 
Hufford,  J.  H. 
Jansen,  R.  H. 
Jordan,  H.  D. 
Joseph,  D.  K. 
Kinzer,  T.  J. ,  III 
Kosdon,  S.  J. 
Kotler,  S.  J. 
Landowne,  D. 


Langbein,  D. 

Lennon,  W.  J. 

Lynch,  J.  T. 
McClees,  H.  C.,  Jr. 
MacDonald,  J.  F.  P. 
Manheimer,  W.  M. 
Marble,  C.  W. 
Marmon,  W.  C. 
Meyer,  J.  S. 

Miller,  S.  W. 

Milner,  P. 

Mudama,  E.  L. 
Musslewhite,  J.  T. 
Okereke,  S.  A. 
Pearlman,  M.  R. 
Porter,  R.  P. 

Purdy,  R.  J. 
Queenan,  J.  F. 
Rabiner,  R. 

Reznek,  S.  R. 
Ricketts,  A.  W. ,  Jr. 


Sarraquigne,  M.  R 
Schlessinger,  L. 
Sehn,  G.  J. 
Shawaker,  D.  R. 
Shroff,  M.  N. 
Shulman,  L.  M. 
Solin,  S.  A. 

Spira,  P.  M. 

Stavn,  M.  J. 
Stokowski,  S.  R. 
Tomes,  C.  F. 
Tripp,  A.  P. ,  Jr. 
Valby,  L.  R. 
Wachtel,  J.  M. 
Wawzonek.  J.  J. 
Weingrad,  J.  S. 
Weintraub,  A.  C. 
Wende,  C.  D. 
Winsor,  N.  K. 
Yansen,  D.  E. 
Zilles,  S.  N. 


Assistants  and  Technicians 


Aquinde,  P. 

Arnold,  Jane  B. 
Atlas,  H. 

Babcock,  E.  B. 
Barrett,  J.  W. 
Barrows,  F.  W. 
Beaton,  Catherine  M. 
Bella,  C.  J. 

Berg,  A.  E. 

Butler,  R.  E. ,  Jr. 
Carbone,  Marie  L. 
Chance,  Eleanor  K. 
Connolly,  J.  T. 

Cook,  J.  F. 

Coyle.  J.  E. 
Cranmer,  R.  E. 
Crist,  F.  X. 

DiPietro,  P.  J. 


Donahue,  P.  D. 

Engler,  R.  R. 
Fitzgerald,  E.  W. ,  Jr. 
French,  Marjorie  A. 
Gay,  H.  D. 

Grande,  Esther  D. 
Greenwood,  E.  L. 
Gregor,  C.  A. 

Hallett,  J.  G. 

Karas,  P. 

Kelly,  M.  A. 

Kierstead,  J.  D. 

Lewis,  R.  R. 

Major,  Diane 
McKenzie,  J.  A. 
McLean,  J.  J. 

Misail,  M.  L. 

Molin,  A.  H. 


Neal,  R.  W. 

Nolund,  J.  F. 

North,  D.  K. 
Overslvizen,  T. 
Papa,  D.  C. 

Peck,  J.  S. 
Porcaro,  R.  J. 
Pyle,  Cynthia  M. 
Samson,  P.  R. 
Schwabe,  W.  J. 
Sears,  A.  R. 

Smith,  Gabriella  W. 
Sprague,  L.  E. 
Stroud,  Marion  B. 
Thompson,  D.  S. 
Tortolano,  A.  J. 
Yaffee,  M.  A. 

Yee,  F.  Q. 


Document  Room 

Hewitt,  J.  H. 
Hurvitz,  Rose  S. 


Drafting  Room 

Navedonsky,  C.  P. ,  Foreman  Morss,  Dorothy  H. 

Donahue,  J.  B. 


Porter,  Jean  M. 
Rollins,  I.  E. 
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Keefe,  J.  B.,  Foreman 
Barnet,  F.  J. 

Bletzer,  P.  W. 

Brennan,  J. 

Bunick,  F.  J. 

Cabral,  M. ,  Jr. 


Amidon,  Willa  T. 
Barron,  Gladys  G. 
Bertozzi,  Norma 
Blais,  Gisele  G. 
Boyajian,  Judith  A. 
Brown,  Eileen  A. 
Brunetto,  Deborah  A. 
Canniff,  Kathleen  A. 
Carbone,  Angelina 
Cavanaugh,  Mary  C. 
Chapman,  Carol  A. 
Daly,  Marguerite  A. 
Dordoni,  Joan  M. 
Epstein,  Elinor  F. 
Geller,  Elaine  J. 


Barnes,  R.  A. 
Fleming,  Patricia  L. 


Doiron,  E.  J.,  Foreman 
Audette,  A.  G. 

Haggerty,  R.  H. 

Joyce,  T.  F. 


Lorden,  G.  J. ,  Foreman 
D'Amico,  C.  R. 


Staff 

Rosebury,  F. 
Ryan,  L.  W. 


Machine  Shop 

Carter,  C.  E. 
Daniels,  W.  M.  . 
Gibbons,  W.  D. 
Liljeholm,  F.  H. 
MacDonald,  J.  R. 
Marshall,  J.  .J. 
Muse,  W.  J. 


Secretaries-  ■■ 

Gordon,  Linda  S. . 
Greulach,  Vicki  E. 
Hall,  Nancy  K. 

Healy,  Sylvia  K. 
Imbornone,  Elaine  C. 
Johnson,  Barbara  A, 
Kaloyanides,  Venetia 
■  Keim,  Norma  R. 
Lannoy,  Doris  E. 
Laurendeau,  Carole  A. 
Litchman,  Sandra  H. 
Loeb,  Charlotte  G. 
May,  Nancy  A. 
McEntee,  Doris  C. 
Milan,  Marilyn  A. 


Technical  Typists 
Levin,  L. 


Stock  Rooms 

Legier,  D.  O. 
Lucas,  W.  G. 
McDermott,  J.  F. 


Technicians1  Shop 

Fownes,  Marilyn  R. 
Howell,  W.  B. 

Tube  Laboratory 

Glass  Blowers 

DiGiacomo,  R.  M. 
Doucette,  W.  F. 
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Reimann,  W. 
Ryari,  J.  F. 
Sanroim£,  J.  B. 
Shmid,  E. 
Smart,  D.  A.  . 
Tucker,  C.  L. 


Morneault,  Beverly  A. 
Morneault,  Diane  M. 
Omansky,  Betsey  G. 
Petone,  Rosina  C. 
Rose,  Martha  G. 
Scalleri,  Mary  B. 
Scanlon,  Dorothea  C. 
Smith,  Claire  F. 
Solomon,  Cynthia 
Staffiere,  Rose  Carol 
Thomson,  Susan  M. 
Toebes,  RitaK. 
Touchette,  Thelma  B. 
Townley,  Madeline  S. 
Vesey,  Patricia  A. 


Levine,  R.  I. 
Rabkin,  W.  I. 


Massey,  L.  N. 
Riley,  J.  F. 
Sharib,  G. 
Sincuk,  J. ,  Jr. 


Lander,  H.  J. 
MacDonald,  K.  B. 


Technicians 

Aucella,  Alice  A. 
Griffin,  J.  L. 
Leach,  G.  H. ,  Jr. 
MacDonald,  A.  A. 


PUBLICATIONS  AND  REPORTS 


MEETING  PAPERS  PRESENTED 


Rice  University,  Electrical  Engineering  Graduate  Seminar,  Houston,  Texas 
December  3,  1962 

P.  A.  Willis,  Computer  Utilization  in  Biomedical  Engineering  Problems  (invited) 

Symposium  on  Neural  Modelling  (sponsored  by  the  Librascope  Division  of  General  Pre 
cision,  Inc.,  and  the  Air  Force  Office  of  Scientific  Research),  Ojai,  California 

December  4-6,  1962 

B.  G.  Farley,  Simulation  of  Neuron  Networks  by  Digital  Computers  (invited) 

L.  Stark,  Retinal  Operations  in  Accommodative  Tracking  —  Non-linear  Describing 
Function  Approach  (invited) 

Texas  Medical  Center,  University  of  Texas,  Houston,  Texas  ’ 

December  4-',  1962. 

P.  A.  Willis,  Communication  Characteristics  of  Nerve  (invited) 

Neurological  Control  Systems  (invited)-  -.  -  'I  .  .' 

Stanford  Research  Institute, '  Stanford,  California.  . . '\t'.  v’ 

December  5,  1962  ....  -  '  •  ’’*• 

L.  Stark,  Automatic  Adaptive  Pattern  Recognition  in  Electrocardiograms- (invited) 

•  '  •  ’  '  .  1  .  •  ‘T-ff  -  -■*  -V'"  'J  ; 

Basic  Sciences  Group  of  Boston  Section  of  AIEE,  Boston,  Massachusetts  % 

December  18,  1962  •  > .  .  .  v 

W.  T.  Peake,  Research  on  the  Auditory  Nervous- System  ^  The.  Role  of.  Engineers: 
(invited) 

American  Association  for  the  Advancement  of  Science,  Biometrics  Division, 
Philadelphia,  Pennsylvania 

December  26,  1962 

M.  Eden,  A  Critical  View  of  Computers  in  Medical  Diagnosis  (invited) 


Modern  Language  Association  Meeting,  Washington,  D.  C. 

December  27,  1962 

E.  S.  Klima,  Relatedness  between  Grammatical  Systems  (invited) 
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MEETING  PAPERS  PRESENTED  (continued) 


American  Physical  Society  Winter  Meeting  in  the  West,  Stanford  University,  Stanford, 
California 

December  27-29,  1962 

W.  M.  Whitney  andC.  E.  Chase,  Ultrasonic  Dispersion  in  Liquid  Helium  below 
1*K 

Annual  Meeting,  Linguistic  Society  of  America,  New  York 
December  28,  1962 

P.  M.  Postal,  The  Formal  Characteristics  of  Grammatical  Agreement 

Woods  Hole  Oceanographic  Institute,  Woods  Hole,  Massachusetts  '  V. 

January  7,  1963'  '  •  ; 

L.  Stark,  On-line  Digital  Computation  in  Neurological  Experiments1  (invited) 

•  ’  •  ”  . .  ...  •.  *  v  *  %  #  ft  .  •  />’* 

American  Chemical  Society  Meeting,  Cincinnati,  Ohio  ;/  Vr. 

January  16,  1963  -  '  •  '•i  '  •'  '• 

.  G.  G.  Hamiqes,  Studies  of  Fast  Reactions  in  Enzymatic  Systems,  v. .... 

New  York  Meeting,  American  Physicsd  Society,  New  York  .  :  •  .  .  ;  . 

January  23-26,  ,1963:  ; 

J.  F  'Cochran  and  M.  Yaqub,  Magneto-resistance  Effects  in  Small  Gallium  Single,  . 

..'•'Crystals  •  "  ;  .'-A-  - 

D..  H.  Douglass,  Jr. ,  M.  S.  Dresselhaus,  and  R.  L.Kyhl,  Measurement  of 'the  • 
Magnetic  Field  Dependence  of  the  Surface  Impedance  of  Superconducting  Tin  at  9  kmc 

W.  D.  Jackson,  C.  R.  Phipps,  and  R.  A.  Reitman,  A-O? Properties  of  Supercon-  '.  .  ? 
v.-‘  ducting  Wires. and  Ribbon  '  '  "  *  1  .'•» 

•  C.  .A.  Shiftman, ..  M.  Garber; '.  J.  F.'  Cochran,  E.  Maxwell,'  andG.  W. ;  Pearsall,  '  The 
’. ..  Specific  Heats  of  Tin- Lead  Alloys  '  x;;' 

■  • .  •  •  •  •  •■'  •'  •  '•••  '•.''•'  '  105  ' *  1'95**‘  . 

W.  W.  Smith,  Hyperfine  Structure  and  Magnetic  Moments.of  Hg  and  Hg  in 

the  63Pj  State  by  Level  Crossing 

S. .  H.  Wemple,  Electric  Field  Effects  on  EPR  Spectra  in  KTa03:Fe3+ 

Association  for  Symbolic  Logic  Meeting,  University  of  California,  Berkeley 
January  26,  1963  . 

T.  More,  Jr. ,  Calculi  of  Sequences  of  Propositional  Formulas 

On  the  Modification  of  Modus  Ponens 
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MEETING  PAPERS  PRESENTED  (continued) 


Two  Lectures,  University  Catholique  de  Louvain,  Belgium 
February  4,  1963 

Y.  W.  Lee,  Statistical  Theory  of  Nonlinear  Systems  (invited) 


The  Seventy-Fifth  Anniversary  Symposium  on  Engineering  for  Major  Scientific  Programs, 
Georgia  Institute  of  Technology,  Atlanta,  Georgia 

February  5-6,  1963 

W.  A.  Rosenblith,  Computers  and  Brains:  Competition  and/or  Co-existence 
(invited) 


Third  Quantum  Electronics  Conference,  Paris 
February  11-15,  1963 

G.  W.  Stroke,  Theory,  Production,  and  Use  of  Optical  Gratings  for  High- Resolution 
Spectroscopy 


Conference  on  Sensory  Evoked  Response  in  Man  (sponsored  by  The  New  York  Academy 
of  Sciences,  Division  of  Biological  and  Medical  Sciences,  and  Albert  Einstein  College 
of  Medicine),  New  York 

February  >4-15,  1963 

J.  S.  Barlow,  Evoked  Response  in  Relation  to  Visual  Perception  and  Reaction  Times 
in  Man 


Biophysical  Society  Seventh  Annual  Meeting,  New  York 
February  18-20,  1963 

B.  G.  Farley,  Similarities  between  the  Behavior  of  a  Neuron-like  Network  Model 
and  "Slow-Wave"  Phenomena  on  the  Brain  (invited) 

J.  L.  Hall  I  .  Binaural  Interaction  in  Single  Units  of  Accessory  Superior  Olivary 

Nucleus  of  Cats 

L.  Stark,  Control  of  Pupillary  and  Eye  Movements  (invited) 

A.  Troelstra,  A  Non-linear  Model  for  the  Calculation  of  the  DC  Component  in  the 
Human  Electro- retinogram  (ERG) 

T.  F.  Weiss,  A  Model  for  Firing  Patterns  of  Auditory  Nerve  Fibers 


1963  International  Solid-State  Circuits  Conference,  University  of  Pennsylvania, 
Philadelphia,  Pennsylvania 

«  February  20-22,  1963 

J.  Y.  Lettvin,  Considerations  Underlying  the  Study  of  Sensory  Elements  (invited) 
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MEETING  PAPERS  PRESENTED  (continued) 


Seminar,  Department  of  Electrical  Engineering,  Johns  Hopkins  University,  Baltimore, 
Maryland 

February  25,  1963 

M.  H.  Goldstein,  Jr.,  The  Role  of  Hearing  in  Animal  Communication  (invited) 

Symposium  on  Metallurgy  of  Electron  Emitting  Materials,  Dallas,  Texas 
February  25,  1963  •  • 

W.  B.  Nottingham,  Electron  Emitting  Materials 


JOURNAL  ARTICLES  ACCEPTED  FOR  PUBLICATION 


(Reprints,  if  available,  may  be  obtained  from  the  Document  Room, 
26-327,  Research  Laboratory  of  Electronics,  Massachusetts  Insti¬ 
tute  of  Technology,  Cambridge  39,  Massachusetts.) 


M.  A.  Arbib,  Monogenic  Normal  Systems  Are  Universal  (J.  Australian  Math.Soc.) 


D.  E.  Baldwin,  Close  Collisions  in  a  Plasma  (Phys.  Fluids) 


J.  S.  Barlow,  R.  L.  Rovit,  and  P.  Gloor,  Correlation  Analysis  of  EEG  Changes 

Induced  by  Unilateral  Intracarotid  Injection  of  Amobarbital  (EEG  Clin.  Neurophysiol.) 


J.  S.  Bross,  Problems  of  Equivalence  in  Some  German  and  English  Constructions 
(MT) 

S.  C.  Brown,  G.  Bekefi,  and  R.  E.  Whitney,  Far  Infrared  Interferometer  for  the 
Measurement  of  High  Electron  Densities  (J.  Opt.  Soc.  Am.) 

D.  H.  Douglass,  Jr.  and  M.  W.  P.  Strandberg,  Stages  in  the  Education  of  a  Physi¬ 
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Introduction 


J-  ..  >:•••"  '•  •  • 

•  -S  *  s.  v 

/  .  'This  report,  the  sixty-ninth  in  a  series  of  quarterly  reports  issued  by  tin?  Research 

i  ,,  . . ^5. ' . V  '  *  *  *  " 

Laboratory-of  Electronics';  contains,  a  review  of  the  research  activities  of  the  Laboratory 
for  the! three-month  period  ending  February  28,  1963.  Since  this  is  a  report  on  work  in 
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A.  ULTRASONIC  ATTENUATION  IN  SUPERCONDUCTORS 

The  work  described  here  was  initiated  several  months  ago  in  the  very  high  frequency 
range.  Equipment  was  designed  and  constructed  for  the  generation  and  reception  of  ultra¬ 
sonic  pulses  at  165  me.  The  transmitter  consists  of  the  high-frequency  and  blocking - 

oscillator  portions  of  the  circuit  used  by  Chick,  Anderson,  and  Truell. 1  The  transmitting 

2 

and  receiving  transducers  are  identical  helical  resonators  exciting  x-cut  quartz  rods 
that  are  inserted  in  an  rf  electric  field  produced  in  a  gap  of  nearly  identical  geometry 
to  that  of  the  re-entrant  cavity  used  in  the  microwave  phonon  experiments.'*  These  hel¬ 
ical  resonators  are  shown  in  Fig.  I-la.  The  two  quartz  rods  (0.  118  inch  in  diameter, 

0.  504  inch  long)  with  a  metal  specimen  (usually  approximately  0.  1  inch  thick)  sandwiched 
between  their  ends  are  inserted  through  a  small  hole  in  the  end  of  each  helical  resonator 
into  the  electric  -field  gap.  The  resonators  are  then  clamped  in  this  coaxial  configuration, 
mounted  on  stainless -steel  coaxial  lines,  and  immersed  in  a  standard  double -de war 
helium  cryostat.  Signal  power  from  the  receiving  transducer  is  mixed  with  local  oscil¬ 
lator  power  in  a  strip-line  hybrid  coupler;  the  dual  output  is  then  passed  through  a  bal¬ 
anced  detector  to  a  30 -me,  120-db,  if  strip. 

This  apparatus  has  been  used  to  study  ultrasonic  attenuation  in  specimens  of  indium 
and  mercury,  both  in  the  superconducting  and  normal  states.  The  metals  were  obtained 
from  standard  chemical  reagent  stock  (In,  99.  99  per  cent  pure;  Hg,  99-  999  per  cent 
pure).  No  attempt  was  made  to  obtain  single  crystals  with  known  orientation.  In  the 
case  of  indium,  the  quartz  transducer  rods  were  clamped  in  a  vee-block  and  the  indium 

4 

was  allowed  to  solidify  in  a  Teflon  mold.  The  mercury  specimen  was  contained  in  a 
small,  cylindrical  aluminum  capsule;  the  quartz  transducer  rods  had  been  fitted  into 
aluminum  bushings  that  were  inserted  into  the  ends  of  the  capsule  and  cemented  into 
place.  (See  Fig.  I-lb.) 

1.  Experiment 


In  s  typical  experiment  the  entire  transducer  assembly  containing  the  metal  speci¬ 
men  is  cooled  to  some  fixed  temperature  well  below  the  superconducting  transition 


a 
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Fig.  1-1.  (a)  Helical  resonators  used  for  the  transmitting  and 

receiving  transducers. 

(b)  Mercury  capsule  and  the  indium  mold.  The  quartz 
transducer  rods  are  shown  protruding  from  each. 
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Fig.  I (a)  Acoustic  pulses  transmitted  through  superconducting 
mercury.  Oj  is  rf  lea1  go;  a..,  an  acoustic  pulse  that 

has  passed  through  both  quartz  rods  and  the  mercury 
specimen;  a^,  superposition  of  two  acoustic  pulses, 

one  having  made  a  double  reflection  in  th€'  first  rod, 
and  the  other,  a  double  reflection  in  the  second  rod. 
(b)  Acoustic  pulses  transmitted  through  normal  mercury. 
This  is  exactly  the  same  situation  as  in  (a),  ’  except 
that  a  magnetic  field  H  »  Hc{T)  has  been  switched  on. 
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temperature.  The  transmitter  initiates,  in  the  transmitting  transducer,  a  pulse 
of  rf  power  of  approximately  25  watts,  peak,  and  1  psec  in  duration.  A  small 

_3 

fraction  of  this  power  {*10  )  is  converted  into  a  pulse  of  acoustic  energy  of 

longitudinal  polarization  by  the  piezoelectric  effect.  This  pulse  passes  along  the 
first  quartz  rod,  through  the  specimen,  and  into  the  second  quartz  rod.  At  the 
receiving  transducer  the  same  fraction  of  the  acoustic  power  is  reconverted  into 
an  rf  signal,  and  the  rest  is  reflected.  Figure  1-2  shows  the  resulting  oscillo¬ 
scope  traces.  The  first  pulse  is  rf  leakage.  The  second  is  an  acoustic  pulse 
that  has  passed  through  both  quartz  rods  and  the  metal  specimen.  The  third 
pulse  is  actually  a  superposition  of  two  acoustic  pulses:  one  has  made  a  double 
reflection  in  the  first  rod;  the  other,  a  double  reflection  in  the  second  rod. 
The  superposition  is  caused  by  the  fact  that  the  two  quartz  transducer  rods  are 
equal  in  length  (within  ±.001  in.). 

An'  electromagnet  surrounds  the  portion  of  the  cryostat  containing  thn  metal 
specimen,  which  is  capable  of  providing  a  variable  magnetic  field  normal  to 
the  ultrasonic  wave  vector  and  the  cylindrical  axis  of  the  metal  specimen.  If 
a  magnetic  field  greater  than  the  critical  field  of  the  superconductor  is  switched 
on,  the  metal  returns  to  the  normal  state  and  the  attenuation  increases  abruptly. 
Figure  I-2b  shows  the  same  pulse-echo  pattern  as  that  shown  in  Fig.  I-2a,  except 
that  the  magnetic  field  H »  Hc(T)  has  been  turned  on.  This  change  in  the  ultra¬ 
sonic  attenuation  is  due  entirely  to  the  effect  of  the  conduction  electrons  in  the 
metal. 

The  relative  attenuation  was  measured  as  a  function  of  applied  magnetic  field 
for  both  metals,  and  is  plotted  in  Fig.  1-3.  Observe  that  it  is  a  continuous 
function,  a  result  that  is  implicit  in  the  aluminum  data  of  David,  Van  der  Laan, 
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and  Paulis.  Data  on  the  total  change  in  attenuation  between  the  normal  and 
the  superconducting  states  should  probably  be  taken  from  the  first  pulse  only. 
The  reason  for  this  is  that  the  acoustic  energy  that  arrives  at  the  receiving 
transducer  at  the  time  indicated  by  the  second  pul3e,  since  it  is  a  superposi¬ 
tion  of  two  acoustic  pulses  that  may  well  have  travelled  over  slightly  different 
paths,  has  an  unknown  phase  factor  determined  by  the  details  of  the  paths. 

A  specimen  of  mercury,  approximately  0.  2  5 -in.  thick,  was  prepared  in  order 
to  obtain  reasonably  reliable  velocity  data.  The  velocity  was  measured  at  room 
temperature  at  165  me  and  found  to  be  1.47  ±  .03  km/sec,  which  is  in  agree¬ 
ment  with  the  measurements  of  Ringo,  Fitzgerald,  and  Hurdle.^  A  specimen 
of  this  length,  however,  is  more  vulnerable  to  thermal  contraction  effects.  Thus 
far,  it  has  not  been  possible  to  maintain  satisfactory  acoustic  contact  between 
such  a  long  specimen  and  the  quartz  transducer  rods  as  the  temperature  is 
lowered. 
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Fig.  1-3.  (a)  Relative  attenuation  of  a  165-mc  ultrasonic  pulse  in 
indium  as  a  function  of  magnetic  field. 

(b)  Relative  attenuation  of  165-mc  ultrasonic  pulses  in 
mercury  as  a  function  of  magnetic  field. 
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2.  Discussion 


An  important  mechanism  for  ultrasonic  attenuation  in  metals  at  sufficiently  low  tem¬ 
peratures  that  the  electronic  mean -free  path  becomes  of  the  same  order  of  magnitude 
as  the  acoustic  wavelength  (usually  below  10*K)  is  the  electron-phonon  scattering.  Thus, 
if  we  have  PQ  acoustic  power  incident  on  a  metal  specimen  of  length  L,  the  power  P 
leaving  the  specimen  will  be 

P  «  PQ  exp[-(«elec+*0)L].  (1) 


Here,  «e^ec  is  that  portion  of  the  attenuation  coefficient  caused  by  scattering  from  the 
conduction  electrons;  aQ  is  that  portion  of  the  attenuation  coefficient  caused  by  scat¬ 
tering  from  thermal  phonons,  lattice  defects,  grain  boundaries,  and  so  forth.  The  coef¬ 
ficient  a  is  temperature -independent  in  the  range  of  temperatures  which  we  are  con-,  , 
sidering,  and  therefore  will  not  be  discussed  further..  Using  the  free-electron  model, 

Pippard  has  calculated  expressions  for  the  electronic  contribution  to  the  attenuation 

7 

coefficient  for  arbitrary  mean-free  path.  This  approach  has  been  well  justified  by  the 

8 

experimental  results  of  Morse  at  frequencies  between  9  me  and  56  me.  Pippard's 
result  for  longitudinal  polarization  is 


ansfl' 


(,(  l\  l  (ql)2  tan"1  (ql) 
*  'w  T  ql  -  tan-*  (ql) 


where 

*  the  electronic  contribution  to  the  attenuation  coefficient  for  the  metal  in  the 
n 

normal  state 

q  *  the  acoustic  wave  vector 
1  ■  the  electronic  mean -free  path. 

The  coefficient  o',  which  is  the  limiting  form  of  cn  for  long  electronic  mean-free 
path,  ql  »  1,  is  given  by 


irNmv  u 

«'  ■ - T~ 

6pou1 


(3) 


where 


N  *  the  number  of  free  electrons  per  unit  volume 
m  ■  the  electronic  mass 

vQ  ■  velocity  of  the  free  electrons  at  the  Fermi  surface 
u  ■  the  ultrasonic  frequency 
pQ  ■  the  density  of  the  metal  specimen 

U|  ■  the  longitudinal  acoustic  velocity  in  the  metal  specimen. 

Reasonable  estimates  based  upon  the  free-electron  model,  under  the  assumption  of 
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one  free  electron  per  atom,  yield  the  following  values  for  a*  at  165  me: 

Indium  «'  ■  4.  2  cm-1 
Mercury  t'>8.6  cm-1. 

The  Bardeen -Cooper -Schrieffer  (BCS)  theory  of  superconductivity  has  been  applied 

q 

to  the  calculation  of  the  ultrasonic  attenuation  in  metals  in  the  superconducting  state. 
Their  result  is 


4s-*  2f[€(T)]. 


(4) 


where 


' .  the  electronic  contribution  to  the  attenuation  coefficient  for  the  metal  in  the 

,  .  superconducting  state 

f[c]  '■  the  Fermi  function  of  the  superconducting  energy  gap 
e(T)  a  .the  temperature -dependent  superconducting  energy  gap. 

.  Let  us  represent  the  change,  in  decibels,  of  the  ultrasonic  attenuation  coefficient 
between  the  normal  and  the  superconducting  states  at  any  given  temperature  by  D(T). 
Then  '  . 


D(T) _ 

‘c(T) 

10  log10(e)  Ltanh  2kT 


(5) 


where 


L  *  the  length  of  the  specimen  in  cm 

'  k  *  Boltzmann's  constant 

T  ■  the  temperature  in  *K. 

•  o 

By  using  the  theoretical  curve  for  e(T)  given  in  BCS,  it  is  possible  to  determine  «n 

from  our  experiments  on  indium  and  mercury. 


Specimen  Purity  D(T)  T-  L  ^°n^exp 

(per  cent)  (db)  (*K)  (cm)  (cm-1) 

In  99.99  3.7  (±.5)  1,75  .172  5.0 

Hg  99.999  5  (±1.0)  1.9  .252  4.6 


These  an>  determined  experimentally,  are  of  the  same  order  of  magnitude  as  the  limiting 
values  a’  calculated,  by  using  Eq.  3,  for  the  metals  on  the  basis  of  the  free-electron 
model.  We  conclude  that  for  165  me  the  purities  of  the  metals  are  sufficient  to  provide 
electronic  mean-free  paths  at  least  of  the  same  order  of  magnitude  as  the  acoustic  wave¬ 
length.  That  is,  ql  >  1. 
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As  we  have  already  pointed  out,  the  relative  attenuation,  plotted  as  a  function  of  mag¬ 
netic  field,  does  not  yield  a  discontinuity  at  H  ,  but  rather  begins  a  slow  rise  at  H  ,  well 
below  Hc<  This  might  be  due  to  the  presence  of  metallic  impurities.  Superconducting 
alloys  are  sometimes  characterized  by  such  a  slow  transition  as  a  function  of  magnetic 
field,  but  this  generally  occurs  at  fields  greater  than  Hq  for  the  corresponding  pure 
elements.  A  more  cogent  explanation  is  that  the  metal  enters  the  intermediate  state  at 
He>  and  then  proceeds  to  the  normal  state  in  a  continuous  manner,  the  transition  reaching 
completion  at  Hc(T).  This  intermediate  state  is  a  property  of  macroscopic  supercon¬ 
ductors,  and  depends  upon  the  geometry  of  the  specimen.  The  same  behavior  has  been 
observed  in  the  resistivity  of  macroscopic  superconductors  as  a  function  of  magnetic 
field.10  The  temperature  dependence  of  Hc  is  given  by 


Hc(T) 


Hc<°> 


(6) 


This  formula  is  an  approximation,  but  it  is  rufficiently  accurate  for  our  use.  Here, 
Hc(0)  is  the  critical  magnetic  field  at  absolute  zero,  and  is  the  critical  temperature 
at  zero  magnetic  field.  Both  are  properties  of  the  particular  superconductor  under 
investigation.  We  now  evaluate  HC(T)  for  each  specimen  using  their  critical  constants 
and  the  temperature,  which  was  determined  from  the  vapor  pressure  of  the  liquid  helium. 


Specimen 

Hc(0) 

(gauss) 

T 

c 

CK) 

T 

CK) 

Hc(T) 

(gauss) 

He 

(gauss) 

n 

exp 

In 

283 

3.41 

1.  75 

209 

120 

.43 

H* 

411 

4.  15 

1.9 

325 

237 

.27 

He  was  determined  from  the  graphs  of  the  experimental  data  (Fig.  1-3). 

The  macroscopic  theory  of  superconductivity  predicts  that  the  metal  will  enter  the 

intermediate  state  at  a  field  H  that  is  such  that 

e 


This  formula  holds  for  macroscopic  bodies  whose  dimensions  are  large  com¬ 
pared  with  a  characteristic  length  A  *  10-5  cm.  The  product  4wn  is  the  demag¬ 
netizing  coefficient  of  the  specimen  which  takes  into  account  the  effect  of  its 
magnetization  on  the  external  magnetic  field  while  in  the  superconducting  state. 
Obviously,  n  depends  upon  the  shape  of  the  specimen.  We  list  a  few  theoretical 
n  values  for  simple  cases: 
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ii 

Sphere  l/3 

Long  cylinder  in  transverse  field  l/2 

Long  cylinder  in  parallel  field  0 

Large  disc  with  its  plane  normal  to  field  1 


The  shapes  of  our  specimens  were  not  controlled  accurately,  but  can  be  described 
approximately.  The  indium  was  formed  into  a  disc,  0.  30  cm  in  diameter  and  0. 172  cm 
thick.  At  one  edge  of  the  disc,  however,  there  was  a  sprue  almost  as  large  as  the  disc. 
The  applied  magnetic  field  was  oriented  parallel  to  the  plane  of  the  disc.  The  mercury 
was  contained  in  a  cylinder,  0.  635  cm  in  diameter  and  0.  635  cm  long.  The  quartz  rods 
penetrated  0.  192  cm  into  the  specimen  at  each  end.  The  applied  magnetic  field  was 
perpendicular  to  the  axis  of  the  cylinder.  The  experimental  values  for  n  show,  indeed, 
that  the  shape  of  the  specimen  is  important  in  determining  the  onset  of  the  intermediate 
state. 

The  author  wishes  to  express  his  appreciation  to  E.  C.  Ingraham  for  the  construction 
of  the  helical  resonators,  the  mercury  capsule,  and  portions  of  the  helium  cryostat. 

J.  M.  Andrews,  Jr. 
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A.  WAVES  IN  A  BRILLOUIN-FOCUSED  ELECTRON-BEAM  WAVEGUIDE 

The  only  known  wave  solution  for  the  Brillouin-focused*  electron  beam  are  the  4>- 

independent  slow  waves  that  were  found  from  an  approximate  (quasi-static  and  weak 
2 

space-charge)  analysis.  We  shall  give  here  the  exact  solution  for  all  waves. 


Consider  a  nonrelativistic  Brillouin-focused  electron-beam  waveguide  (Fig.  II-l). 
In  the  unperturbed  state  this  beam  is  characterized  by  a  uniform  velocity  vq  along  the 
applied  magnetic  field  Bq,  a  constant  angular  velocity  equal  to  the  Larmor  frequency 
<*>k  =  -eBo/2m,  and  the  Brillouin-focusing  condition 


1 

2“ 


2 

P’ 


U> 


2 

where  u>*  =  e  pQ/meo  is  the  plasma  frequency. 

The  small -signal  equations  of  this  system  are  most  conveniently  written  in  a  coor¬ 
dinate  system  that  is  translating  with  velocity  v  and  rotating  with  angular  velocity  u, . 
Quantities  relating  to  this  coordinate  system  will  be  designated  by  double  primes.  In 
this  frame  Maxwell's  equations  take  the  simple  form 


V  XE«  =  -jwpoH" 


(2) 


a 

This  work  was  supported  in  part  by  Purchase  Order  DDL  B-00368  with  Lincoln 
Laboratory,  a  center  for  research  operated  by  Massachusetts  Institute  of  Technology 
with  the  joint  support  of  the  U.  S.  Army,  Navy,  and  Air  Force  under  Air  Force  Contract 
AF19(604)-7400. 
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V  X  H«  =  jue0 


(3) 


which  corresponds  exactly  to  the  equations  that  govern  an  isotropic  plasma  waveguide. 

3 

The  complete  solutions  for  waves  in  an  isotropic  plasma  waveguide  are  known.  Hence 
the  complete  solutions  for  the  waves  in  a  Brillouin -focused  electron-beam  waveguide 
can  readily  be  obtained  by  a  proper  transformation  of  the  isotropic  plasma  waveguide 
solutions. 

1.  Dispersion  Characteristics  .  - 

The  dispersion  characteristic  for  the  isotropic  plasma  waveguide  is  shown  in 
Fig.  II -2.  Here  we  shall  consider  only  those  branches  that  contain  slow  waves  (p>w/c). 
For  <  u>cQ  (the  cutoff  frequency  of  the  empty  waveguide)  the  branches  that  contain  only 
fast  waves  occur  for  u  >  0)^,  and  are  not  shown-in  Fig.  II -2.  The  dispersion  character¬ 
istics  shown  are  for  the  field  solutions  with  dependence  exp  j(u"t*  -m*$"  -p"t"). 

The  dispersion  characteristics  in  a  coordinate  system  (denoted  by  single  primes) 
that  is  rotating  at  a  constant  angular  velocity  u^  are  readily  obtained  through  the  non- 
relativistic  transformations 


u*  s  u*  -  m'uj^ 
m"  =  m1 
p«  =  p*. 


(4) 

(5) 

(6) 


The  results  are  shown  in  Fig.  II -3.  For  most  electron-beam  systems  of  interest,  ru^  « 
c  and  the  nonrelativistic  transformations  of  Eqs.  4  and  5  apply.  The  first-order  cor- 

4 

rections  to  these  transformations  can  be  obtained  from  general  relativity,  and  are 


u*  *  (w'-m'ujjY^ 


m"  =  ■ 


,  ~  m' 


where 


(7) 

(8) 


alV1/2 


(9) 


The  dispersion  characteristics  in  the  laboratory  frame,  which  is  translating  at  the 
constant  velocity  vQ  with  respect  to  the  primed  frame,  are  obtained  through  the  non¬ 
relativistic  transformation 


u'  *  u  -  Sv 

o 


(10) 
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Fig.  11-2.  Dispersion  characteristics  in  the  rotating  and  translating  frame -plasma  frame. 


Fig.  11-3.  Dispersion  characteristics  in  the  frame  rotating  with  respect  to  the  plasma 
frame  (translating  with  respect  to  the  laboratory  frame) .  (a)  m  =  0,  (b)  m  = 
±1,  (c)  m  =  ±2,  (d)  m  =  ±3. 


Fig.  II-4.  Dispersion  characteristics  in  the  laboratory  frame,  (a)  m  =  0  (no  backward 
waves),  (b)  m  *  ±1,  (c)  m  =  ±2,  (d)  m  =  ±3. 


Fig.  11-5.  Dispersion  characteristics  in  the  laboratory  frame 
for  m  s  0  waves  containing  a  backward-wave  region 
(Eq.  15). 
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P'  =  P.  (11) 

The  results  are  shown  in  Fig.  II -4.  The  nonrelativistic  transformation  of  Eqs.  10  and  11 

is  valid  for  v  «c  and  for  slow  waves,  p  >  u/c.  The  relativistically  correct  transfor- 

0  4 

mations  are  readily  obtained  from  special  relativity,  and  are 


w'  =  (u-pvQ)v0 


(12) 


(13) 


(14) 


In  Figs.  II-4b-d  we  note  that  backward-wave  characteristics  (positive  group  velocity 
and  negative  phase  velocity)  are  obtained  for  all  waves  with  m  *  0.  For  the  waves  with 
circular  symmetry,  m  =  0,  Fig.  II -3a  and  Eq.  10  or  12  show  that  for  certain  beam 
velocities,  vQ,  backward-wave  characteristics  must  also  be  possible.  The  condition 
for  this  possibility  is 

Io(kpb)  "1-1/2 

!+ - =  .  (15) 

kpbliikpbjlnf 

where  IQ  and  I(  are  the  modified  Bessel  functions,  kp  =  <*»p/c,  and  c  is  the  velocity  of 
light  in  free  space.  Unlike  the  case  for  a  confined  flow  beam,  BQ  =  »,  Eq.  15  is  easily 
satisfied  for  most  beams  that  can  be  produced  in  practice.  Under  the  conditions  of 
Eq.  15,  the  dispersion  characteristic  of  Fig.  II -4a  changes  to  that  shown  in  Fig.  U-5. 


vo<c 


2.  Energy-Power  Characteristics 

The  time -averaged,  small -signal  energy  per  unit  length  of  waveguide,  W",  in  the 

3  a 

isotropic  plasma  waveguide  described  by  Eqs.  2  and  3,  is  positive  definite.  Hence  all 

of  the  waves  in  the  double -prime  frame  are  passive  waves.  The  power  flow  in  this 

frame,  P",  is  entirely  electromagnetic, 
z 

In  the  coordinate  system  rotating  at  the  angular  frequency  u>.  with  respect  to  the 

**  5 

double -prime  frame,  the  nonrelativistic  transformations  of  energy  and  power  give 


w  w» 

_ a  _  a 

“ST  yi 

(16) 

pi  p« 

z  z 

utvi  -  w.v» . 

(17) 

g  g 
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where  is  the  group  velocity  (energy  velocity)  for  the  wave.  The  transformations  in 
Eqs.  4  and  16  show  how  the  active  (negative  energy)  waves  appear  in  the  prime  frame, 
Fig.  II-3.  The  group  velocity  of  the  waves  remains  unchanged. 

The  transformations  of  energy  and  power  to  the  laboratory  frame*1  are 


u 


(18) 


(19) 


The  appearance  of  active  waves  can  now  be  traced  to  the  transformations  in  Eqs.  10  and 
18.  We  note  the  existence  of  active  waves  with  phase  velocities  that  may  be  either  slower 
or  faster  than  the  velocity  of  light  and  v  . 

A.  Bers,  R.  S.  Smith 
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III.  MOLECULAR  BEAMS 
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A.  CESIUM  BEAM  TUBE  INVESTIGATION 

During  the  past  quarter  a  new  control  system  was  investigated  which  has  the  desir¬ 
able  property  of  minimizing  the  effects  of  the  transient  behavior  of  the  cesium  beam 
tube.  As  we  mentioned  in  Quarterly  Progress  Report  No.  67  (page  45),  the  transient 
occurring  in  the  beam  tube  when  the  X-band  excitation  frequency  is  modulated  could 

make  circuit  phase  tolerances  so  severe  as  to  preclude  obtaining  a  stability  of  one  part 
12 

in  10  .  The  technique  used  in  the  new  system  consists  essentially  of  utilizing  square - 

wave  frequency  modulation  of  the  excitation  at  a  sufficiently  lew  repetition  rate  (11  cps) 
to  provide  intervals  that  are  comparatively  free  of  transients,  and  then  gating  these 
transient-free  intervals  for  use  in  the  control  loop. 

The  various  waveforms  generated  in  the  system  are  shown  in  Fig.  III-l.  Figure  Ill-la 
is  the  11 -cps  square  wave  from  which  all  subsequent  waveforms  are  derived.  Its  duty 


R,  S.  Badessa 
V.  J.  Bates 
B.  Brabson 
R.  Golub 
G,  L.  Guttrich 


W.  D.  Johnston,  Jr. 
S.  G.  Kukolich 
F.  J.  O'Brien 
C.  O.  Thornburg,  Jr. 
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(0 


Fig.  IU-1.  Waveforms  present  in  the  cesium  clock  system. 
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cycle  of  50  per  cent  is  guaranteed  by  driving  a  2:1  flip-flop  from  a  22-cps  pulse.  The 
square  wave,  appropriately  delayed,  also  serves  as  the  reference  for  the  synchronous 
detector  of  the  loop.  Figure  III- lb  is  a  triangular  waveform  obtained  by  integration  of 
the  square  wave  of  Fig.  Ill- la.  It  is  used  for  phase  modulation  of  the  16.4-mc  crystal 
oscillator  that,  by  frequency  multiplication  to  X-band.  serves  ultimately  as  the  exci¬ 
tation  signal.  Triangular  phase  modulation  conveniently  yields  the  desired  square-wave 
frequency-modulation  waveform.  Figure  III-lc  is  the  waveform  observed  at  the  output 
of  the  beam  tube  electron  multiplier,  when  the  average  excitation  frequency  is  located 
on  one  side  of  the  cesium  resonance  peak.  The  exponential  rise  and  decay  is  the  tran¬ 
sient,  and  represents  a  time  constant  approximately  equal  to  the  time  of  travel  of  the 
cesium  atoms  down  the  beam  tube.  -  It  will  be  noted  that  the  low  repetition  rate  of  1 1  cps 
permits  a  relatively  flat  region  to  exist  just  before  the  next  frequency  excursion.  It  is 
this  flat  region  that  is  gated  for  use  in  the  loop,  as  is  shown  in  Fig.  Ill- Id.  Figure  IU-le 
shows  how  the  fundamental  component  of  the  signal  of  Fig.  Ill-Id  disappears  to  form  a 
22-cps  series  of  random  noise  bursts  when  the  excitation  frequency  coincides  exactly 
with  that  of  cesium  resonance.  Figure  III —If  is  generated  by  delaying  the  signal  of 
Fig.  Ill-la,  and  represents  the  reference  signal  for  the  synchronous  detector. 
Figure  III  -2  is  a  block  diagram  of  the  complete  system. 

Figure  UI-3  shows  the  method  used  for  checking  the  quadrature  level.  The  synchro¬ 
nous  detectors  are  mechanical  choppers  to  prevent  spurious  direct  current  from  being 


Fig.  QI-2.  Cesium  clock  system  utilizing  time-domain  filtering  for 
minimizing  transient  effects. 
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Fig.  Ill  -  3 .  Double  synchronous  detectors  for  measurement  of  quadrature. 


produced  by  reference  unbalance.  Because  the  references  are  90*  out  of  phase,  the 
occurrence  of  zero  direct  current  simultaneously  at  A  and  B  can  be  achieved  only  by 
the  complete  absence  of  an  1 1  -cps  component  in  the  incoming  error  signal.  To  obtain 
a  calibration  for  the  measurement,  the  loop  is  opened  and  the  excitation  frequency  is 
detuned  to  one  side  of  the  resonance  peak,  to  give  maximum  dc  voltage  at  A.  After 
measuring  this  voltage,  the  excitation  frequency  is  recentered  to  the  peak  of  the  reso¬ 
nance,  and  the  loop  closed.  Under  the  closed -loop  condition  the  voltage  at  A  will  be 
very  close  to  zero.  The  voltage  at  B  will  be  a  measure  of  quadrature  and  can  be 
expressed  as  a  fraction  of  the  voltage  that  was  observed  at  A  under  the  open-loop  con¬ 
dition.  When  measurements  were  made  on  the  system  of  Fig.  III-2,  with  a  gate  width  A 
equal  to  one-third  of  a  modulation  half -period  (see  Fig.  Ill-Id)  and  the  gate  was  posi¬ 
tioned  just  ahead  of  the  next  frequency  excursion  of  the  modulation,  the  quadrature  level 
was  well  below  that  of  the  noise  passing  through  the  30-sec  time-constant  filter.  As 
the  position  of  the  gate  was  advanced  toward  the  transient  region,  the  quadrature 
appeared,  as  would  be  expected.  The  worst  quadrature  occurred  when  the  gating  was 
completely  removed  so  that  the  signal  to  the  synchronous  detectors  had  the  form  shown 
in  Fig.  III-lc.  Under  this  condition  the  quadrature  was  approximately  1.6  per  cent  of 
the  in-phase  signal  level,  or  approximately  78  times  the  error  voltage  that  would  appear 
at  the  in-phase  synchronous  detector  if  the  crystal  oscillator  were  to  detune  one  part 
in  1012. 

The  stability  of  an  atomic  clock  is  determined  partly  by  the  stability  of  the  resonance 
characteristic  of  its  beam  tube  and  partly  by  the  stability  of  the  electronics  of  the  servo 
loop  used  to  lock  to  it.  These  two  effects  are  difficult  to  separate  when  seeking  stabilities 


1 

4 
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1 2 

of  one  part  in  10  Certain  performance  requirements  must  be  met  by  the  loop,  how¬ 
ever,  if  such  stability  is  to  be  achieved.  For  example,  suppose  that  we  were  to  remove 
the  modulation  from  the  excitation  signal  of  the  beam  tube  and  open  the  loop  (see 
Fig.  Ill -2).  Since  the  signal  entering  the  synchronous  detector  should  now  be  completely 
uncorrelated  with  the  synchronous  detector  reference,  no  direct  current  should  appear 
at  its  output.  The  random  variations  that  do  appear  should  not  exceed  the  dc  level  that 

would  occur  when,  in  normal  operation  of  the  loop,  the  excitation  frequency  is  detuned 
12 

by  one  part  in  10  The  measurement  is  made  with  a  filter  having  a  time  constant 

1 2 

equal  to  the  measurement  time  required  for  one  part  in  10  on  the  basis  of  beam  tube 
signal -to -noise  ratio.  In  Fig.  III-4  is  shown  a  block  diagram  of  the  noise  susceptibility 
measurement  system.  The  results  are  shown  in  Fig.  III-5. 

In  Fig.  Ill— 6  is  shown  a  block  diagram  of  a  test  performed  to  determine  the  inter¬ 
changeability  of  two  independently  adjusted  electronic  systems.  Atomic  clock  No.  1 
was  operated  in  the  normal  manner,  but  the  11-cps  error  signal  appearing  at  the  out¬ 
put  of  its  electron  multiplier  was  fed  to  the  electronic  apparatus  of  the  other  clock. 
Because  of  the  servo  action  of  the  first  clock's  loop,  very  little  11-cps  signal  would  be 
expected  at  the  output  of  the  electron  multiplier.  However,  if  the  loop  is  not  operating 
properly,  for  example,  because  of  quadrature  effects  or  overloading  on  noise  peaks, 
an  excessive  amount  of  11-cps  signal  could  exist.  The  presence  of  this  component 
would  cause  a  dc  output  to  appear  at  the  synchronous  detector  of  the  second  clock  in 


Fig.  III-4.  Measurement  of  susceptibility  to  noise. 
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Fig.  Ill -5.  Noise  susceptibility  measurements. 
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Fig.  III-6.  Measurement  of  interchangeability  of  two  independently 
adjusted  systems. 


Fig.  III-6.  On  the  other  hand,  if  the  two  electronic  systems  are  in  agreement  and  both 
are  free  of  spurious  effects,  an  input  signal  that  gives  a  zero  average  out  of  one  should 
give  a  zero  average  out  of  the  other.  Figure  III-7  shows  the  results  of  such  a  test. 
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Fig.  III-7.  Electronics  interchangeability  measurements. 


The  two  systems  were  adjusted  independently  and  had  in  common  only  the  22-cps  oscil¬ 
lator  frequencies  of  Fig.  III-2,  necessitated  by  the  fact  that  the  synchronous  detector 
of  clock  No.  2  was  required  to  demodulate  the  modulation  introduced  by  clock  No.  1. 

Measurements  of  frequency  stability  are  now  in  progress.  We  hope  to  present  the 
results  of  the  tests  in  our  next  report. 

R.  S.  Badessa,  V.  J.  Bates,  C.  L.  Searle 


QPR  No.  69 


21 


IV. 

RADIO 

ASTRONOMY* 

Prof.  A.  H. 

Barrett 

A.  T. 

Anderson  III 

G.  A. 

Garosi 

Prof.  J.  W. 

Graham 

J.  c. 

Blinn  III 

J.  W. 

Kuiper 

Prof.  R.  P. 

Rafuse 

R.  K. 

Breon 

B.  R. 

Kusse 

Dr.  G.  Fiocco 

P.  J. 

Brendel 

D.  H. 

Staelin 

R.  J.  Allen 

D.  H. 

Steinbrecher 

A.  K-BAND  RADIOMETRY  AND  OBSERVATIONS 

Observations  at  25.  5  kmc  were  made  of  Venus,  Taurus  A,  iiie  Sun,  and  the  Moon 
during  the  first  two  weeks  of  December  1962.  These  were  made  with  the  Lincoln 


Fig.  IV- 1. 

28 -ft  diameter  parabolic 
antenna  at  Lincoln  Lab¬ 
oratory,  M.  I.  T. 
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Laboratory  28-ft  antenna  shown  in  Fig.  IV- 1,  and  the  superheterodyne  radiometer 
described  in  Quarterly  Progress  Report  No.  68  (pages  35-37).  Observations  were  made 
simultaneously  at  35.  2  kmc  at  Lincoln  Laboratory,  M.  I.  T. 

The  data  have  not  been  completely  processed,  so  only  preliminary  results  are 
available. 

The  observations  of  Taurus,  the  Sun,  and  the  Moon  were  all  drift  scans  in  which 
the  antenna  was  pointed  ahead  of  the  source  and  held  stationary  while  the  source  moved 
past.  Venus  was  observed  by  using  drift  scans  and  on-off  measurements.  An  on-off 
measurement  consisted  of  tracking  with  the  antenna  pointed  first  at  one  side  of  the 
source,  on  the  source,  and  then  off  again.  Each  of  these  three  steps  lasted  one  or  two 
minutes. 

Venus  was  observed  with  approximately  130  drift  scans,  Taurus  A  with  20,  the  Sun 
with  8,  and  the  Moon  with  9-  Venus  was  also  observed  with  approximately  60  on-off 
measurements.  Brief  attempts  were  made  to  observe  Cassiopea  A  and  Jupiter,  but 
without  success. 

Figure  IV-2  shows  a  typical  drift  scan  of  the  sun.  The  variation  in  the  center  is 
believed  to  be  due  to  small  gain  fluctuations,  and  the  wings  of  the  curve  are  due  to  the 


TIME 

Fig.  IV-2.  Solar  drift  curve. 
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Fig.  IV-3.  Lunar  drift  curve. 


antenna  side  lobes.  Figure  IV-3  shows  a  typical  drift  scan  of  the  Moon.  The  asym¬ 
metry  in  the  curve  is  due  to  the  nonuniform  temperature  of  the  Moon's  surface  which 
exists  3.  5  days  before  full  Moon. 

The  preliminary  results  are: 

1.  Venus  (on-off  data),  Tg  =  400  ±  100*K 

2.  Taurus  A  (assuming  disk  of  3.  6*  diameter  of  uniform  intensity), 

Sy  =  230(±100)  X  10~26  w/mZcps 

3.  Sun  (center),  Tg  =  8900  ±  980*K 

4.  Moon  (center) 

3.  5  days  before  full  Moon,  Tg  *  240  ±  40*K 
0.3  day  before  full  Moon,  Tg  =  254  ±  30  *K 
1.8  days  after  full  Moon,  Tg  =  254  ±  30*K 
In  this  list  the  error  brackets  are  estimates. 

These  preliminary  results  are  close  to  the  expected  values.  The  final  results  will 
be  presented  after  the  rest  of  the  data  are  analyzed  and  final  corrections  for  antenna 
gain  and  atmospheric  attenuation  are  completed. 

D.  H.  Staelin 


B.  RADIOMETER  AT  FOUR  MILLIMETER  WAVELENGTH 

In  Quarterly  Progress  Report  No.  66  (page  49)  a  Millimeter  Radio  Telescope  was 
described.  This  report  will  discuss  the  development  of  a  radiometer  for  use  with  this 
antenna. 

The  block  diagram  of  the  radiometer  is  shown  in  Fig.  IV-4.  It  is  a  Dicke  type  of 
radiometer  utilizing  a  ferrite  switch  to  compare  the  input  signal  with  a  known  signal, 
a  single-ended  mixer,  and  two  traveling-wave  tubes  for  if  amplifiers.  The  input  is 
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centered  at  72  Gc  (4.  16  mm),  the  local  oscillator  at  69. 0  Gc,  and  the  if  strip 
has  a  measured  bandpass  of  approximately  1200  me.  The  receiver  noise  figure 
was  measured  to  be  31  db,  10  db  greater  than  the  design  objective.  The  source 


f ^ERRITE^ 

2-4  Gc 

TRAVELING 

CRYSTAL 

1000  -CYCLE 

(switch  ) 

MIXER 

WAVE  TUBES 

DETECTOR 

AMPLIFIER 
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Fig.  IV-4.  Four  millimeter  wavelength  radiometer. 


Fig.  IV- 5.  Schematic  diagram  of  millimeter  wave  section  of  the  radiometer. 
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of  difficulty  lies  in  the  crystal  mixer.  An  effort  is  now  being  made  in  the  lab¬ 
oratory  to  obtain  an  improved  mixer  for  use  in  millimeter -wave  radiometers 
with  microwave  if  strips. 

The  radiometer  was  installed  in  the  10-ft  antenna,  and  an  on-axis  gain  measurement 
made.  The  remote  transmitter  used  in  testing  the  10-ft  parabolic  antenna  was  located 
at  Quigley  Memorial  Hospital  at  Soldier's  Home  in  Chelsea,  a  distance  of  4.  25  miles 
from  the  Compton  Laboratories  Building  in  Cambridge.  The  transmitter  package  con¬ 
sisted  of  a  6-inch  parabolic  antenna,  klystron,  and  microwave  equipment  mounted  on 
an  optical  telescope.  The  millimeter -wave  section  of  the  radiometer  is  shown  schemat¬ 
ically  in  Fig.  IV-5,  and  pictorially  in  Fig.  IV-6.  This  section  of  the  radiometer  was 
mounted  at  the  focus  of  the  10-ft  antenna,  while  the  TWT  if  amplifiers  and  second  detec¬ 
tor  were  mounted  in  the  counterbalance  structure  at  the  rear  of  the  antenna. 

A  standard  gain  horn  is  used  to  provide  the  comparison  signal.  When  the  precision 
attenuator  is  timed  to  balance,  the  antenna  gain  is  given  by 

G(db)  =  Gcal(db)  +  Lattn<db)  +  Lpath(db) 

where 

GCal  =  gain  of  atandard  gain  horn 


Lattn  =  calibrated  attenuation 


Lpath  =  difference  in  path  loss. 

A  comparison  technique  such  as  this  reduces  the  effects  of  variations  in  transmitter 
power  and  atmospheric  attenuation. 

Weather  conditions  have  not  permitted  a  careful  optimization  of  the  feed -horn  posi¬ 
tion.  Consequently,  the  following  data  indicate  only  a  lower  bound  on  the  performance 
of  the  antenna.  Rough  measurements  show  that  the  antenna  has  an  on-axis  gain  of  at 
least  62  ±  2  db,  an  efficiency  of  31  per  cent,  H-plane  (azimuth)  side  lobes  of  12  db  and 
15  db,  and  E-plane  (elevation)  side  lobes  of  20  db. 

J.  C.  Blinn  III 


C.  OPTICAL  RADAR  TO  STUDY  THE  EARTH'S  ATMOSPHERE 

The  use  of  an  optical  radar  to  study  the  Earth's  atmosphere  by  back-scattering  has 
been  considered  and  the  construction  of  an  instrument  is  well  advanced. 

In  particular,  we  plan  to  study  the  dust  layers1  up  to  heights  of  100  km.  At  first 
our  main  object  will  be  some  systematic  observations  of  the  noctilucent  clouds  (~8 0  km); 
these  clouds  are  more  easily  distinguishable  from  molecular  Rayleigh  scattering,  which 
is  predominant  at  lower  heights.  The  mechanism  of  Raman  scattering  will  also  be 
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utilized  to  infer  density  profiles  for  the  various  atmospheric  molecular  species  because 
the  value  of  the  wavelength  shift  is  a  characterictic  of  each  species. 

The  optical  radar  consists  of  a  short -pulse  ruby  laser  capable  of  the  following  per¬ 
formance: 

energy  in  one  pulse,  0.  5-1. 0  joule 

length  of  pulse,  1  fisec 

pulse  repetition  frequency,  1  cps. 

The  laser  beam  is  collimated  by  means  of  a  refractor  of  2.  019-m  focal  length,  and 
0.  075-m  diameter.  The  receiver  utilizes  a  Newtonian  reflecting  telescope  of  2.  70-m 
focal  length  and  0.  30-m  diameter,  and  a  photoelectric  photometer. 

G.  Fiocco 
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V.  NOISE  IN  ELECTRON  DEVICES* 


Prof.  H.  A.  Haua  Prof.  R.  P.  Rafuse 

Prof,  P.  L.  Penfield,  Jr.  W.  D.  Rummler 


A.  PHOTON  STATISTICS  OF  OPTICAL  MASER  OUTPUT 

The  measurement  of  the  output-power  spectra  of  narrow-band  optical-maser  oscil¬ 
lators  or  amplifiers  is  currently  under  way.  Information  on  the  power  spectra  may  be 
obtained  from  the  statistics  of  the  output  photon  number.  These,  in  turn,  may  be 
inferred  from  the  count  of  photoelectrons  from  a  photosurface  exposed  to  the  optical- 
maser  beam.  We  shall  first  present  the  classical  theory  of  such  an  experiment,  and 
then  summarize  the  preliminary  experimental  results. 

We  assume  that  the  probability  of  producing  a  photo -electron  current  pulse  within 
the  time  interval  dt  is  aP(t)  dt;  P(t)  is  the  power  in  the  incident  light  (in  general,  a 
statistical  quantity),  and  a  is  a  factor  incorporating  the  photoefficiency  of  the  photo¬ 
tube.  The  probability  of  obtaining  exactly  K  counts  in  an  interval  T  follows  the  Poisson 
law  for  nonstationary  processes. 


P(K)  =  e  n, 

(1) 

n  s  C  aP(t)  dt. 

(2) 

If  P(t)  is  a  statistical  variable,  an  average  has  to  be  taken  with  respect  to  its  proba¬ 
bility  distribution.  We  shall  indicate  this  last  average  by  a  bar. 


and 


”  ,  ip 

nT  =  2  P(K)K  =  \  «  W)  dt  «  «TP 

K=1  '  0 

n*  *  £  K*p(K)  =  H^T+  aP(t)  dtj 


(3) 


(4) 


The  last  integral  can  be  evaluated  easily  in  two  limits;  for  T  «  tq,  and  T  »  tq,  where 

r  is  the  correlation  time  of  P(t)  which  is  of  the  order  of  the  inverse  bandwidth  of  the 
o 

maser.  (For  optical-mauer  amplifiers  with  highly  reflecting  mirrors  this  time  will 
in  general  not  be  shorter  than  10  nsec.  For  an  oscillator  it  will  be  much  longer  than 
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that.)  In  the  limit  T  «  r  , 

6n2  =  n2,  -nT2=  aTF  +  a2T2P2(0)  -  a2T2P2. 
For  a  Gaussian  amplitude  distribution  we  have 


(5) 


P2  =  3P2 


(6) 


and  tftus 


2  —  — 
6n^,  ®  n^,(l+2n^,). 


(7) 


This  result  applies  for  a  linearly  polarized  light  output.  If  the  light  is  unpolarized,  the 
mean -square  fluctuations  are  additive,  and  we  have 


2  _  _ 

6n,p  =  n^O+iVj,). 

For  a  sinusoid,  this  is 


(8) 


2  — 
6nT  =  nT- 


(9) 


Thus,  for  a  photoelectron  count  nT  »  1  within  a  time  T  «  t  ,  there  is  a  great  difference 
between  the  fluctuations  observed  for  a  Gaussian  signal  and  that  of  a  sinusoid  (or  near 
sinusoid). 


In  the  limit  T  »  tq,  we  may  approximate 
[£  -  dtj2  S'T  JT  P(t)  P(t+T)  dt. 


(10) 


For  a  Gaussian  process 

P(t)  P(t+x)  =  1>Z  +  2R2(t),  (11) 

2 

where  R  (t)  is  the  square  of  the  amplitude  autocorrelation  function  that  is  related  to 
the  spectrum,  and  thus  to  the  line  shape,  by  a  Fourier  transform.  We  obtain  for 

3 

linearly  polarized  light  (compare  Purcell  ) 


6n^,  3  nT  ^1  +  nT  . 


(12) 


Here,  we  have  set 


+oo 


2  J  R‘,(t)  dr 
—oo 

¥ - ' 


03) 
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2 

The  limiting  value  of  60j,/aj,  attained  for  large  T,  if  the  photon  flux  is  held 
constant  (so  that  n^,  is  proportional  to  T)  gives  the  correlation  time  rQ. 

We  have  made  preliminary  measurements  on  an  optical  maser  oscillator  of  6328  A 
center  frequency  and  1-mw  output,  using  a  phototube  (EMI/9558)  and  a  high-speed  coun¬ 
ter  with  a  maximum  counting  rate  of  10  counts/psec.  The  light  was  attenuated  before 
reaching  the  multiplier  and  the  low  spikes  were  rejected  in  the  count.  The  shortest 
counting  interval  used  was  100  psec,  with  "n^,  =  5.5.  If  the  maser  signal  had  been 

2 

Gaussian,  we  should  have  seen  the  large  6nT  corresponding  to  Eqs.  7  or  8.  This  was 

— T  -  -4 

not  observed.  We  found  6n^,  =  fln^,,  for  T  =  10  to  10  sec  in  decade  steps,  where  p 
was  a  number  varying  in  magnitude  between  0.  5  and  12,  the  large  deviation  from  unity 
occurring  for  relatively  long  observation  times  T  and  large  average  photon  counts.  The 
deviations  of  p  from  unity  may  be  due  in  part  to  the  statistics  of  the  photomultiplication. 
We  believe  that  the  experiment  has  shown  that  the  signal  is  not  Gaussian,  exhibiting 
power  fluctuations  much  smaller  than  those  corresponding  to  a  Gaussian  signal  of  a 
bandwidth  that  one  may  reasonably  expect  for  a  maser  oscillator.  The  experiments 
are  continuing.  The  experiments  are  undertaken  jointly  with  C.  Frer  1  and  R.  J.  Carbone 
of  Lincoln  Laboratory,  M.  I.  T. ,  and  J.  McDonald,  a  senior  in  the  Department  of  Elec¬ 
trical  Engineering,  M.  I.  T. 

G.  Fiocco,  H.  A.  Haus 
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A.  INTERACTION  BETWEEN  THE  RADIATION  FIELD  AND  THE 
STATISTICAL  STATE  OF  A  PLASMA 


The  fact  that  the  electromagnetic  radiation  generated  by  a  plasma  can  alter  the  sta¬ 
tistical  distribution  of  the  radiating  particles  is  generally  not  taken  into  consideration. 
The  effects  are  expected  to  be  large  when,  for  example,  the  plasma  departs  from  ther¬ 
modynamic  equilibrium  and  becomes  unstable.  The  intense  radiation  that  the  plasma 
can  now  generate1  reacts  on  the  energy  distribution  of  the  particles.  And  it  is  not  incon¬ 
ceivable  that  by  driving  the  distribution  closer  to  a  Maxwellian,  the  instability  will  in 
time  be  quenched. 

The  problem  of  coupling  between  the  radiation  field  and  the  statistical  state  of  the 

2  3  4 

plasma  has  been  discussed  recently  by  Kudryavtsev,  Akhiezer  et  al.,  and  Dreicer. 

A  self-consistent  calculation  requires  a  simultaneous  solution  of  two  nonlinear  equations 
and  thus  far  has  not  been  carried  out.  In  this  report  we  make  no  attempt  to  solve  the 
problem  —  we  shall  merely  set  up  the  appropriate  equations  and  present  them  in  a  form 
different  from  the  one  given  by  the  above-mentioned  authors.  Since  much  of  the  intense 
radiation  occurs  at  radio  and  microwave  frequencies  and  the  emission  is  primarily 
caused  by  the  free  electrons,  we  shall  restrict  our  considerations  to  this  situation. 

We  have  two  equations  that  describe  a  system  composed  of  a  plasma  and  its  radiation 
field.  The  one  is  the  equation  of  transfer  for  the  flow  of  radiant  energy.  If  1  is  the 
intensity  of  radiation  in  the  radian  frequency  interval  between  <*>  and  u  +  do  along  a  ray 
"I,  then 


J. 

c 


81 
_ u 

8t 


K 


a  I 


w  w 


(1) 


where  jw  and  «u  are  the  emission  and  absorption  coefficients  of  the  plasma,  respectively. 
The  foregoing  equation  can  be  rewritten1  in  terms  of  the  rate  of  spontaneous  emission  of 


$ 
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radiation,  «  ,  and  the  distribution  of  electron  velocities,  f(v): 

U) 


,  91  81 

j _ y  _ y 

c  9t  8s 


n  9  n  3  2  p  8f(v)  «  1 

)  V(v>  4w2dv  i  hjv)  —  4trv2  dvj  1^. 


(2) 


o 

Here  u  ■  mv  /2  is  the  electron  energy,  and  f(v)  has  been  assumed  to  be  isotropic  (in 
principle  there  is  no  difficulty  in  extending  Eq.  2  to  anisotropic  distributions). 

The  second  equation  is  Boltzmann's  equation  that  prescribes  the  distribution  of  elec¬ 
tron  velocities  and  concentrations  in  the  presence  of  fields  and  particle  interactions: 


8f 

8t 


+  v 


(3) 


E  and  B  are  the  externally  applied  fields.  Since,  however,  1^  can  be  considered  as  the 

electromagnetic  radiation  composed  of  externally  applied  fields,  as  well  as  the  internally 

generated  radiation,  E,  H  represent  only  that  portion  of  the  external  forces  not  included 

in  I  . 
u 

The  term  *-ra(^ation  (hereafter  Cfa<j)  is  a  way  of  writing  symbolically  the  difference 
between  the  rates  at  which  electrons  are  scattered  into  and  out  of  a  volume  element  of 
velocity  space  as  a  result  of  their  emission  and  absorption  of  radiation;  it  is  this  term 
that  is  generally  neglected  in  computations  of  f  and  which,  as  we  shall  see,  makes  Eq.  3 
a  function  of  I  . 

u  3 

We  shall  now  calculate  Cra{j.  The  quantity  cratjd  v  equals  the  rate  at  which  elec¬ 
trons  enter  the  volume  element  d3v  minus  the  rate  at  which  they  leave  d3v,  as  a  result 
of  spontaneous  emission,  absorption,  and  stimulated  emission.  We  shall  consider,  first, 

3 

the  rates  at  which  they  enter  and  leave  d  v  as  a  result  of  radiative  transitions  to  and 

3  3 

from  two  neighboring  elements  d  (v')  and  d  (v"),  where  v'  <  v  and  v"  >  v.  The  speeds 
v,  v',  and  v"  are  such  that  transitions  v  Z  v',  vZv"  are  associated  with  the  emission 
or  absorption  of  a  photon  fiw  whose  frequency  is  given  by 


fiu  =  (1/2)  mv2  -  (1/2)  mv'2 
*  (l/2)  mv"2  -  ( l/2)  mv2. 


(4) 


Writing  as  the  rate  of  absorption  and  quS  as  the  rate  of  stimulated  emission,  we 
obtain 


Crad(v‘v'*v")  d3y  *  [\A,V'>  f<v,)  d3v'-T>wA(v)  £(v)  d3y]fe  d“  d£1 

+  f(y")  d3vn“iuS(v)  f(v)  d3y]i£ dw  dn 

+  [q Jv")  f(v»)  d3v"  -  qjv)  f(v)  d3v]  (5) 
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where  r^g,  and  are  related  as  in  Einstein's  A  and  B  coefficients: 

v,W(v)  =  v*r,uS(V)  ^ 


We  simplify  Eq.  5  by  proceeding  to  the  low-frequency  limit  fiu  —  0.  By  means  of  Eqs.  6 
we  first  replace  and  qug  by  the  rate  of  spontaneous  emission,  q^.  We  then  expand 
f(v'),  f(v"),  qu(v'),  and  qu(v")  in  Taylor  series  about  the  velocity  v,  retaining  three 
terms  of  the  expansion  wherever  1^  appears  as  a  multiplier  in  Eq.  5,  and  two  terms 
when  I  does  not  enter.  After  somewhat  lengthy  algebraic  manipulations  and  use  of 
Eqs.  4  we  find  that 


Crad(v'v,'v") 


i  dhutv)f(v)]  8,v 
— a? —  +  —z~ 


du  d£2. 


(7) 


The  total  value  of  C.,  is  obtained  by  integrating  Eq.  7  over  all  frequencies  u  of  the 
low-frequency  spectrum  and  over  all  angles  dn  in  which  radiation  flows.  Since  1^  and 
refer  to  one  characteristic  wave  only,  we  must  sum  over  both  modes  of  polarization. 
Thus, 


'rad 


V  r  r  1  [d<vV>  .  8*3c2  .  d  df 

Z  )  3  ~2  ~dv~  +~r  v'vTv 

1,2  u  S  mV  *-  w  -J 


du  dn. 


(8) 


2 

As  a  check  on  the  algebraic  manipulations,  we  multiply  Eq.  8  by  4irv  dv  and  integrate 

e  00  2 

over  all  velocities.  We  find  the  correct  result,  Jg  Crad  4irv  dv  =  0,  which  shows  that 
electrons  are  conserved.  Knowing  the  form  of  Crad»  we  can  in  principle  evaluate  1^ 
and  f  from  Eqs.  2  and  3  for  any  radiation  mechanism,  >)  ,  of  a  plasma  subjected  to 
external  fields  E,  B.  Note  that  all  of  our  results  apply  in  the  nonrelativistic  limit. 
Extension  of  the  results  to  plasmas  with  relativistic  electrons  is  straightforward. 

The  two  terms  on  the  right-hand  side  of  Eq.  8  refer  to  different  physical  processes. 
The  first  term  describes  the  net  rate  at  which  particles  leave  a  certain  energy  range 
as  a  result  of  energy  loss  by  spontaneous  emission.  The  second  term  represents  the 
rate  of  "heating"  of  electrons  by  the  ambient  radiation  field,  1^. 


Special  Cases 

(a)  Consider  a  collisionless  plasma  which  is  in  a  steady  state  in  the  presence  of 
the  radiation  field  1^ ,  which  in  part  is  due  to  the  radiation  from  the  plasma,  and  in 
part  to  any  external  sources  that  may  be  present.  Then,  from  Eq.  3, 


JI  bC 

8t  '■'rad 


0. 


(9) 
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Evaluating  Eqs.  8  and  9,  we  obtain 


f  =  constant  X  exp 


f 


r  f  f  8it3C2 


mv  dv 


(10) 


where  the  symbols  Z  /  /  denote  summation  over  two  polarizations  and  integrations  over 
u  and  ft. 

(b)  Suppose  that  in  addition  to  Cra{j,  the  electrons  make  elastic  collisions  with  atoms 
or  ions.  If  v(v)  is  the  collision  frequency,5  then 


"coll 


-  G 


1 

7? 


d 

dv 


v2v 


m 


(111 


where  G  =  2m/M  with  M  as  the  mass  of  the  atom  or  ion,  and  T  as  their  temperature. 
In  the  steady  state  (with  E  =  B  =  0  as  in  case  (a)). 


Ht  ”  '‘"rad  +  '"coll 


=  0. 


(12) 


Solving,  we  obtain 


f  cJCexp 


^  uGv  -I 


mv  dv 


kT  +  Z  /  /  •S2~-  I  -^r- 
g  11  2  u  uGv 


(13) 
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B.  ELECTRON  TEMPERATURE  DECAY  IN  THE  AFTERGLOW  OF  A 
PULSED  HELIUM  DISCHARGE 

The  Transient  Microwave  Radiation  Pyrometer 1  has  been  used  to  study  the  electron 
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Fig.  VI- 1.  Electron  temperature  decay  for  varying  amounts  of  added  argon  impurity. 


2 

temperature  decay  in  the  helium  afterglow. 

Figure  VI- 1  illustrates  the  dependence  of  the  temperature  decay  on  fractional  addi¬ 
tions  of  argon  of  zero  per  cent,  0.36  per  cent,  and  3.67  per  cent  at  a  helium  pressure 

of  0.520  mm  Hg.  As  we  have  noted,  the  asymptotic  temperature  decay  rate  at  this  pres- 

2 

sure  should  be  equal  to  the  rate  of  destruction  of  metastable  atoms.  The  increase  in 

this  asymptotic  decay  rate  with  increasing  argon  concentration  bears  out  our  hypothesis 

that  metastable  atoms  are  determining  the  asymptotic  behavior  of  the  temperature 

decay.  From  the  observed  asymptotic  decay  rates  in  Fig.  VI- 1,  a  cross  section  for  the 

—  16  2 

argon-metastable  destructive  collision  can  be  calculated.  A  value  of  2.6  X  10  cm 

is  obtained  which  compares  reasonably  with  Biondi's^  measurement  of  the  argon  ioniza- 

-17  2 

tion  cross  section  in  an  argon-metastable  collision  of  9  X  10  cm  . 

Figure  VI-2  shows  the  temperature  decay  for  lower  pressures  at  which,  presumably, 
the  metastables  are  lost  by  diffusion  at  such  a  rapid  rate  that  their  heating  effect  is  not 
felt  in  the  afterglow.  The  electron  temperature  decay  in  this  case  should  now  be  deter¬ 
mined  by  elastic  recoil  energy  losses.  The  theoretical  curve  for  such  a  decay  may  be 
readily  calculated  for  helium  for  which  the  collision  probability  is  known  fairly  well  to 
be  Pc  *  20p  cm-1,  where  p  is  in  mm  Hg.  The  pressure  multiplied  by  the  time  is  the 
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normalized  time  variable,  so  that  the  curves  should  superpose  if  only  elastic  recoil  is 
operating  to  cool  the  electrons.  The  curve  at  p  ■  0.120  mm  Hg  displays  some  residual 


metastable  heating,  while  the  curve  at  p  =  0.03  mm  Hg  displays  some  apparent  additional 
cooling.  Calculations  have  indicated  that  diffusion  cooling  should  not  contribute  signifi¬ 
cantly  in  this  region. 

J.  C.  Ingraham 
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C.  MEASUREMENTS  OF  ELECTRON  RELAXATION  RATES  IN  PLASMAS 

The  thermodynamic  state  of  a  plasma  can  be  inferred  from  its  radiation  spectrum. 

In  this  report  we  present  measurements  of  the  radiation  spectrum  in  the  afterglow  of 
a  plasma.  By  observing  the  approach  of  the  spectrum  toward  one  associated  with  the 
equilibrium  state,  it  is  possible  to  infer  the  relaxation  rates  to  the  equilibrium  con¬ 
dition.  We  shall  show  that  in  our  plasma  two  relaxation  rates  are  dominant.  At  high 
pressures  the  relaxation  toward  a  Maxwellian  distribution  is  governed  by  short-range 
electron-atom  collisions,  while  at  low  pressures  the  dominant  relaxation  mechanism 

comes  from  electron-electron  impacts. 

1  2 

In  previous  reports  '  we  have  shown  that  the  microwave  radiation  spectrum  from 
weakly  ionized  plasmas  departs  significantly  from  the  Kirchhoff-Planck  law.  The 
departure  has  been  shown  to  be  consistent  with  theory,  provided  that  one  assumed  a 
non-Maxwellian  energy  distribution  for  the  plasma  electrons.  The  magnitude  of  the 
departure  also  depends  on  the  energy  dependence  of  the  electron-neutral  collision  fre¬ 
quency  for  momentum  transfer  in  the  energy  range  where  most  of  the  electrons  are 
situated.  By  comparing  the  measured  and  calculated  spectra,  it  has  been  possible  to 
deduce  the  electron  energy  distribution  function  and  the  mean  energy. 

Recent  measurements  further  substantiate  the  above-mentioned  results.  We  meas- 

1  2 

ure  the  radiation  temperature  Tr  (as  defined  previously  '  )  of  a  positive  column  in  argon, 
immersed  in  a  dc  magnetic  field  as  a  function  of  time  in  the  afterglow  of  the  plasma.  A 
discharge  of  1-msec  width  is  pulsed  at  a  frequency  of  200  cps,  and  the  radiation  temper¬ 
ature  is  measured  as  a  function  of  magnetic  field  (B  =“b^»  where  u>b  is  the  electron 
cyclotron  frequency),  at  a  fixed  time  in  the  afterglow.  The  measurements  were  per- 

3 

formed  with  the  transient  radiation  pyrometer  described  in  a  previous  report,  and  the 

q 

measuring  frequency  was  fixed  at  6irX  10  rad/sec. 

The  pyrometer  accepts  radiation  within  a  1-nsec  gate,  which  is  movable  in  time 
from  10  usee  before  the  discharge  pulse  is  shut  off  to  3  msec  past  this  cutoff  point.  We 
take  this  point  to  be  our  reference  in  time  (i.e.,  t*0).  so  that  when  t  is  negative  we 
refer  to  results  obtained  from  a  "going"  discharge,  and  when  t  is  positive  we  refer 
to  the  results  obtained  in  the  afterglow. 

Some  of  the  results  are  shown  in  Fig.  VI-3.  In  all  cases  the  radiation  temperature 
is  shown  as  a  function  of  frequency  or  magnetic  field  for  various  times  in  the  afterglow; 
pQ  is  the  pressure  reduced  to  0*C,  and  I  is  the  discharge  current.  Figure  VI-3a  and 
VI-3b  gives  the  results  at  a  low  pressure  for  two  discharge  currents.  Figure  VI-3c 
and  VI-3d  shows  the  results  for  the  same  currents  but  at  a  higher  pressure.  The  depar¬ 
ture  of  the  radiation  temperature  from  the  Kirchhoff-Planck  law  (i.e.,  the  peaks  at  cyclo¬ 
tron  resonance,  w  «  ufa)  is  seen  to  be  decreasing  as  a  function  of  time  in  the  afterglow. 
This  is  consistent  with  the  interpretation  that  these  peaks  are  due  to  non-Maxwellian 
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electron  velocity  distributions.  As  the  discharge  pulse  is  cut  off,  the  electrons  relax 
to  a  Maxwellian  distribution  and  the  peaks  disappear.  The  approach  to  a  Maxwellian 
distribution  may  be  due  to  electron-electron  collisions  or  electron-neutral  collisions. 

At  low  pressures  it  is  the  electron- electron  collisions  that  dominate  the  relaxation 
process.  Here  we  would  expect  the  rate  of  relaxation  to  be  strongly  dependent  on  the 
electron  density.4  This  is  borne  out  by  the  results  shown  in  Fig.  VI-3a  and  VI-3b,  in 
which  the  peak  in  Tr  is  seen  to  decrease  much  faster  for  a  discharge  current  of  34  ma 
than  for  a  current  of  18  ma.  Figure  VI-3c  and  VI-3d  shows  that  the  relaxation  rate  (as 
interpreted  by  the  decrease  of  the  peaks  in  Tr)  at  a  higher  pressure,  for  which  electron- 
neutral  collisions  dominate  the  process,  is  relatively  insensitive  to  discharge  current. 

The  results  can  only  be  interpreted  qualitatively,  at  this  time,  for  two  reasons. 
First,  the  radiation  temperature  does  not  relate  linearly  to  the  distribution  function, 
and  second,  we  did  not  measure  the  electron  density  in  the  afterglow.  We  are  now  con¬ 
ducting  calculations  that  will  enable  us  to  determine  the  distribution  function  and  the 

mean  energy  of  the  electrons  as  a  function  of  time  in  the  afterglow  by  the  methods  pre- 
12 

viously  described.  '  We  have  installed  a  probe  in  our  discharge  tube  in  order  to 
measure  the  electron  density  in  the  afterglow.  With  these  results  we  hope  to  determine 
a  time  constant  for  the  approach  of  the  electrons  to  a  Maxwellian  distribution. 

H.  Fields,  G.  Bekefi 
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D.  VISCOUS  DAMPING  OF  PLASMA  WAVES 

An  attempt  will  be  made  to  explain  the  experimental  results  on  propagation  and 
damping  of  ion  acoustic  waves  reported  by  Wong,  D'Angelo,  and  Motley.1  The  momen¬ 
tum  transport  equations  will  be  modified  by  the  introduction  of  a  viscous  force.  This 
procedure  was  suggested  by  a  comparison  of  the  wavelength  and  Coulomb  mean-free 
path.  For  a  frequency  of  105  cps  and  a  phase  velocity  of  105  cm/sec,  the  associated 
wavelength  is  1  cm.  For  a  particle  density  of  6  X  1010/cm'1  and  a  temperature  of 
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—6  2 

2300’K,  the  Debye  length  squared  is  approximately  1.8  X  10  cm  .  Hence,  taking  the 
magnitude  of  the  mean-free  path  as  L  *  (nXpjXp,  where  n  is  the  density,  and  is  the 
Debye  length,  we  obtain  L  *  0.2  cm.  Because  of  the  similarity  of  the  magnitudes  of 
wavelength  and  mean-free  path,  we  suspect  that  the  waves  will  be  strongly  viscous- 
damped.  For  the  magnitude  of  the  viscosity  coefficient  we  take  ti  »  LV^,p,  where  V^, 
is  the  thermal  velocity,  and  p  is  the  mass  density.  Hence  the  kinematic  viscosity  will 
be  v  =  n/p  *  LV,p. 

The  equations  that  will  be  used  to  obtain  the  dispersion  relation  are  the  linearized 
force  and  continuity  equations  and  Poisson's  equation. 


±^0|  +  v  y2j* 

m±  ±  ± 


(i) 


(2) 


V  •  E  =  4ire(n+-n_).  (3) 

In  these  equations  the  particle  flux  T±  is  defined  as  nQV±,  where  V±  is  the  ion  or  elec¬ 
tron  velocity,  and  nQ  is  the  unperturbed  particle  density;  n±  is  the  perturbed  particle 

density;  and  u  is  the  ion  or  electron  sound  velocity.  For  equal  electron  and  ion  tem- 
8 

*  2  2  2  2 

peratures  we  have  =  m+/m_ps  and  v_  *  m+/m_v+.  By  using  Poisson's  equation 
""  + 

and  the  continuity  equations,  the  force  equations  may  be  written  in  terms  of  the  perturbed 
densities. 


®  2  2  2  2 
—Z  =  ps  V  n±  T  «p±(n+-nj  +  v±V  — , 


where  Up  is  the  plasma  frequency,  and 


(4) 


,  4im  e 
£  o 


P±  m± 


By  allowing  n±  to  vary  as  e^k  ‘  r  "  wt)  fn  gq  4(  tjje  following  equations  are  obtained: 
fu2  -  p2  k2  -  u>2  +  jv  wk2]  n  +  u2  n  =0 

L  s+  p+  +  4 +  p+  -  (5) 

«2  n+  +  [J  -  p2  k2  -  <i>2  +  jv_wk2J  n_  =  0. 

Setting  the  determinant  of  Eq.  5  equal  to  zero,  the  following  dispersion  relation 
results: 
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(6) 


For 

2  5 

and  ,  for  m+/m_  =  10  , 


In  the  experiment  of  Wong  and  others, 1  o>2  varies  from  -4  X  107  to  4  X  101  *. 
nQ  *  6  X  1010/cm3»  wp  will  be  approximately  1.8  X  1020, 

15  + 

will  be  1.8  X  10  By  using  this  information,  as  well  as  the  order  of  magnitude  of  the 
sound  velocity  and  the  kinematic  viscosity,  an  approximate  equation  for  the  ion  waves 
can  be  written  from  Eq.  6. 


k2  = 


2>s+-j*v+ 


(7) 


Setting  k  =  kQ  +  iJC,  where  kQ  indicates  the  propagation,  and  3C  the  damping,  Eq.  7 
yields 


ko  -  zir-+  x«> 


w-  st-yw. 
% 


(8) 


where 


t  = 


In  Fig.  VI-4  we  show  the  quantities  (v+/Vs  )  K  and  (l/2pg  )u>/k  *  vphase^2,As  as 

(o  \  +  +  v  + 

v+u/2|ia  j.  It  should  be  mentioned  that  the  phase  velocity,  from  simple 

Navier-Stokes  theory,  rises  much  more  rapidly  with  increasing  w  than  higher  order 

theory  or  experiments  indicate,  while  the  damping  remains  adequately  described  by 
2  3 

the  simple  theory.  ' 

In  Fig.  VI-5  we  present  the  first  attempt  to  fit  the  damping  theory  to  the  experimental 
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Fig.  VI-4.  Theoretical  propagation  and  damping  functions. 


Fig.  VI-5.  3C  as  a  function  of  frequency. 
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data  (Fig.  2  of  Wong  et  al.1).  The  data  give  X  *  0.47  for  potassium  at  a  frequency 
of  105  cps.  At  this  frequency  we  choose  (v+u/2n2  )  =  5.  Since,  from  Fig.  VI-4, 

X  =  1.96  at  this  point,  we  have  p.  /v  =  0.24.  This  guess  is  seen  to  give  a 
+  s+  8+/  + 

fairly  good  representation  of  the  experimentally  observed  damping. 

The  phase  velocity  obtained  from  our  fitted  data  is  high  because  for  u  itself  we 

5  +  1 

obtain  2.6  X  10s  cm/sec,  which  is  approximately  the  observed  phase  velocity.  Our 

computed  phase  velocity  will  vary  from  4  X  10^  cm/sec  at  10  kc/sec  almost 

linearly  to  11  X  105  cm/sec  at  100  kc/sec.  The  fitted  point  also  gives  a  value  of 
5  2 

v+  w  11  X  10  cm  /sec,  which  is  a  correct  order  of  magnitude. 

Wong,  D'Angelo,  and  Motley,  in  their  Figure  1,  give  the  experimentally  observed 
phase  velocity  as  a  function  of  frequency.1  These  data  appear  to  indicate  slightly  higher 
phase  velocities  for  higher  frequencies,  as  well  as  higher  phase  velocities  for  lower 
densities.  Figure  VI-4  shows  an  increasing  phase  velocity  with  frequency  and  also,  if 
the  frequency  is  held  fixed,  an  increasing  phase  velocity  with  decreasing  density,  since 

v+  ~  nXD  ~  1/n‘ 

For  the  sake  of  completeness,  we  shall  compute  the  viscosity  from  Boltzmann 
theory. 


4  TkT 
v+  "  3  m  • 


(9) 


In  Eq.  9  T  is  the  temperature,  k  is  Boltzmann's  constant,  m  is  the  ion  mass,  and  t  is 
the  time  constant  given  by  Spitzer,5 

m1/,2(3kT)3/2 

t= - 2 - •  d°> 

8  X  .714irne  In  A 

where  A  =  12ir3/2nX.~. 

u  10  3 

For  potassium,  with  n  =  6  X  10  /cm  and  T  =  2300*K,  we  obtain 

v+  =  0.883  X  105  cm2/sec.  (11) 

Defining  \f2" |x  as  2.5  X  105  cm/sec,  the  measured  value  of  the  phase  velocity,  we 
s  + 

obtain  at  100  kc/sec, 


v+w 


<*JTns)2 


=  0.886. 


(12) 


The  theoretical  curve  and  the  experimental  points  for  X,  the  damping  constant,  as 
a  function  of  frequency  are  shown  in  Fig.  VI-6.  It  is  clear  that  the  viscous  theory  gives 
the  correct  order  of  magnitude  of  the  damping. 
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Fig.  VI-6.  Damping  constant  as  a  function  of  frequency. 


Therefore,  at  present,  it  seems  that  a  simple  viscous  theory  can  explain  the 
observations  on  ion  waves  without  recourse  to  Landau  damping  as  discussed  by  Fried 
and  Gould.  ® 

H.  R.  Radoski 
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A.  BEAM-PLASMA  DISCHARGE 

When  a  strong  electror.  beam  is  sent  through  a  partially  evacuated  region,  a  series 
of  events  occurs  that  have  led  us  to  coin  the  name  "beam-plasma  discharge."  By  this 
we  mean  a  dynamic  process  in  which  plasma  is  produced  by  electrons  that  have  been 
accelerated  in  microwave  fields,  and  the  microwaves  in  turn  have  been  generated  by 
the  strong  interaction  between  the  beam  and  the  plasma.1  The  interaction  between  the 
beam  and  plasma  is  such  that  intense  microwave  oscillations  are  produced  either  in  the 
neighborhood  of  the  electron  cyclotron  frequency  or  of  the  electron  plasma  frequency, 

«jp.  The  oscillations  are  accompanied  by  large  oscillatory  velocities  of  the  plasma  elec¬ 
trons.  When  these  velocities  become  large  enough,  the  plasma  electrons  will  further 
ionize  the  ambient  gas.  thus  shifting  up  and  the  oscillation  frequency. 

Simultaneous  with  the  ionization  process,  considerable  rf  emission  and  light  emission 
occur  and  large  ion  currents  appear  at  the  walls  of  the  cylindrical  waveguide.  Some  of 
the  plasma  electrons  are  accelerated  to  very  high  energies  by  the  fields  in  the  discharge. 
This  is  inferred  from  the  x-rays  emitted  by  the  walls  and  by  a  molybdenum  target,  as 
well  as  from  the  large  negative  floating  potential  of  a  probe  placed  near  the  beam. 


1 .  Plasma  Electron  Energy 

Four  different  types  of  measurements  have  been  used  to  determine  the  electron 
energy  in  the  plasma. 

(a)  An  average  electron  temperature  has  been  calculated  from  measurements  of  ion 
currents  leaving  the  plasma  volume,  and  of  the  loss  of  power  by  the  electron  beam.  In 
the  steady  state,  the  equation  of  continuity  is 
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I+  ■  I_  ■  n_evivp»  (0 

where  I+  is  the  total  ion  current,  I_  is  the  total  electron  current  leaving  the  plasma 
volume  Vp,  n_  is  the  plasma  density,  and  Vj  is  the  ionization  frequency. 

The  power  balance  equation  is  approximately 

f  ^Vq  =  +  I_T_,  (2) 

where  fIQV0  is  the  fraction  of  the  beam  power  (IQV0)  given  to  the  plasma,  P^  is  the 
power  lost  in  ionization  and  excitation  processes,  and  I_T_  is  the  power  lost  because 
of  electrons  leaving  the  plasma  volume,  and  T_  is  the  mean  energy  of  the  electrons  in 
volts. 

If  we  neglect  excitation  losses 

pi  =  *'in_eVivp.  (3) 

where  V.  is  the  ionization  potential.  With  the  use  of  Eqs.  1  and  3,  Eq.  2  becomes 

MT-  +  Vi>  =  fIoVo-  <4> 

In  a  typical  experiment  on  system  B  with  helium  gas  used: 

f  a  0.3 

=  24.  6  volts 

I+  =  6.  6  amps 

IQ  =  1  a^P 

VQ  =  10.  000  volts. 

Then 

T_  *  430  volts. 

(b)  The  presence  of  high-energy  plasma  electrons  has  also  been  inferred  from  meas¬ 
urements  of  x-rays  emitted  by  a  molybdenum  target.  The  target  (l  in.  X  1. 5  in.  X  0.  01 
in. )  is  movable,  and  experiments  were  performed  with  the  target  far  from  the  beam 
(Fig.  VII- 1).  At  low  pressures  when  a  beam-plasma  discharge  cannot  be  sustained, 
no  x-rays  are  observed.  In  the  presence  of  the  discharge  x-rays  are  observed  with  the 
target  parallel  to  the  beam  axis  but  the  intensity  is  weak.  With  the  target  at  45*  with 
respect  to  the  beam  axis,  in  either  direction,  the  x-rays  observed  are  considerably 
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Fig.  VII- 1.  Experimental  geometry  and  schematic  of  x-ray  absorption 
measurements  in  system  A. 


more  intense.  The  results  of  a  specific  set  of  measurements  are  given  in  Fig.  VU-2. 
These  are  for  a  beam  voltage  of  10  kv,  with  the  target  oriented  45*  to  the  beam  axis. 

By  adjusting  pressure  and  magnetic  field  for  maximum  x-ray  intensity,  we  have  been 
able  to  make  absorption  experiments  with  varying  thicknesses  of  aluminum  sheet  up  to 
0. 180  inch  thick.  The  log  of  the  observed  intensities  when  plotted  against  the  thickness 
of  the  absorber  gave  an  average  slope  that  falls  between  the  line  of  molybdenum 
(19.5  kv)  and  Kn  line  of  tin  (25. 1  kv)  as  shown  in  Fig.  V1I-2.  (The  collimator  resolution 
was  such  that  both  the  target  and  the  tin-coated  wall  were  visible  to  the  scintillator 
detector.)  The  measurement  indicates  the  presence  t  electrons  of  approximately  20-kv 
energy  or  twice  the  beam  voltage. 

(c)  The  presence  of  high-energy  electrons  is  also  inferred  from  measurements  of 
probe  floating  potential.  The  instantaneous  voltage  between  a  Langmuir  probe  and 
ground  was  measured  at  various  positions  in  the  discharge.  The  probe  was  grounded 
through  a  1-megohm  voltage  divider.  The  time  constant  was  approximately  1  psec.  A 
typical  variation  of  floating  potential  with  position  at  a  given  axial  position  is  shown  in 
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Fig.  VII-2.  Plot  of  normalized  x-ray  intensity  vs  thickness  of  aluminum  sheet. 

Helium  gas  pressure,  0. 2  n  Hg;  magnetic  mirror,  2: 1;  central 
field,  600  gauss;  beam  voltage,  10  kv. 

Fig.  VII-3.  Negative  floating  potentials  in  excess  of  1000  volts  were  observed  when  the 
beam  voltage  was  5.6  kv;  this  observation  indicates  the  presence  of  high-energy  elec¬ 
trons. 

(d)  Another  estimate  of  the  average  electron  energy  was  made  by  comparing  the 
intensities  of  two  He  I  lines.  The  light  was  observed  just  after  the  beam  was  turned  off. 
Preliminary  measurements  indicate  an  electron  temperature  of  30-100  ev,  but  there 
are  several  conditions  in  the  experiment  which  may  not  satisfy  the  assumed  theoretical 
conditions.3  Work  will  continue  on  this  method  to  determine  the  validity  of  these  pre¬ 
liminary  results. 
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2.  Electron  Density 

The  plasma  density  ranges  from  lO11/0^  to  1013/cc.  This  was  estimated  in  two 

ways.  First,  the  highest,  strongest  emitted  frequency  (that  is  not  a  harmonic  of  some 

1  /2 

lower  emitted  frequency)  was  assumed  to  equal  up,  which  is  proportional  to  nj  . 

Second,  the  density  was  estimated  from  the  average  total  charge  flowing  to  the  walls 

2 

after  the  beam  pulse  had  ended  and  from  an  estimate  of  the  plasma  volume. 

3 .  Plasma  Ion  Energy 

O 

In  a  typical  helium  beam-plasma  discharge,  the  width  of  the  4686  A  line  was  found 

O 

to  be  0.4  A.  If  this  observed  broadening  were  caused  entirely  by  thermal  motion  of  the 
ions,  the  ion  temperature  would  be  5  ev.  The  plasma  ion  can  acquire  energy  from  the 
microwave  fields,  from  the  dc  fields,  or  through  collisional  heating.  The  microwave 
and  dc  fields  still  have  to  be  measured,  but  the  fact  that  a  positive  floating  potential  is 


Fig.  VII-3.  Floating  potential  vs  times  at  various  positions  in  system  A. 

Hydrogen  gas  pressure,  0.2  |i  Hg;  magnetic  mirror,  2: 1;  cen¬ 
tral  field,  600  gauss;  beam  voltage,  5.6  lev. 
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frequently  observed  near  the  walls  (see  Fig.  VII-3)  suggests  that  ions  are  expelled  from 
the  plasma  by  a  large  space  potential  between  the  discharge  and  the  walls.  This  space- 
charge  electric  field  may  have  been  responsible  for  the  spread  of  ion  velocities  inferred 
from  the  spectral  broadening. 

It  can  be  shown  that  collisional  heating  of  the  ions  by  the  electrons,  competing  with 
losses  that  are  due  to  charge  exchange  and  to  loss  of  ions  from  the  mirror,  cannot  pro¬ 
duce  an  ion  temperature  of  this  magnitude,  given  our  neutral  particle  density  of 
7  X  1012/cc. 

L.  D.  Smullin,  W.  D.  Getty,  B.  A.  Hartenbaum,  H.  Y.  Hsieh 
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B.  INSTABILITY  IN  THE  HOLLOW -CATHODE  DISCHARGE 

An  instability  in  the  hollow- cathode  discharge1  has  been  observed  which  may  be  asso¬ 
ciated  with  the  diffusion  of  plasma  across  the  magnetic  field  away  from  the  arc.  We 
observe  a  single  broad  "spoke”  of  plasma  rotating  about  the  axis  of  the  discharge.  The 
density  and  shape  of  this  cloud  has  been  studied,  and  its  behavior  with  various  arc  para¬ 
meters  is  being  observed.  A  theoretical  study  of  the  problem  is  being  made. 

1.  Behavior  of  the  Plasma  Cloud 

Our  experimental  arrangement  is  shown  in  Fig.  VII-4.  The  arc  is  run  on  the  mag- 

_4 

netic  axis  of  the  system  in  argon  at  a  pressure  of  several  times  10  mm  Hg.  The  mag¬ 
netic  field  may  be  varied  continuously  up  to  approximately  600  gauss,  and  arc  currents 
in  the  range  5-20  amps  are  used. 


Fig.  VII-4.  Experimental 
arrangement. 
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The  plasma  cloud  has  been  studied  with  a  pair  of  movable  Langmuir  probes.  The 
density  and  shape  of  the  cloud  has  been  approximately  determined  from  ion  saturation 
current  and  temperature,  and  the  frequency  of  rotation  of  the  cloud  has  been  plotted 
against  various  parameters.  We  observe  that  the  instability  only  appears  for  magnetic 


5“ 


b,  2  X  1010;  c,  4  X  1010;  d,  8  X  1 0 1  °;  e,  1.8  X  1011;  f,  2.5  X  10U. 


Magnetic  field  into  the  page. 


DISTANCE  FROM  ARC  AXIS  (INCHES) 

Fig.  VII -6.  Density  profiles  of  disturbance. 
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fields  above  a  certain  value,  which  is  not  a  strong  function  of  pressure  or  arc  current. 

Figure  VII-5  is  a  density  contour  map,  each  contour  corresponding  to  a  constant 
density  at  an  instant  of  time.  The  whole  map  rotates  counterclockwise  at  a  few  kilo¬ 
cycles.  It  is  observed  that  the  cloud  does  not  exhibit  any  helical  pattern,  that  is,  there 
is  no  measurable  phase  shift  of  the  density  maximum  with  distance  along  the  axis. 

Figure  VII-6  shows  density  profiles  taken  along  the  lines  marked  nm  and  nmin  on 

the  contour  map.  Also,  n _ -n._  is  plotted  on  the  same  ex’s. 

max  mm 

Figure  VII-7  illustrates  the  variations  in  rotational  frequency  with  magnetic  field 
and  with  arc  current.  The  frequency  is  approximately  proportional  to  the  square  root 
of  magnetic  field,  and  increases  with  increasing  arc  current.  Figure  VII-8  shows  that 
the  relation  between  the  period  of  rotation  and  the  background  pressure  is  nearly  linear. 


Fig.  VII-7.  Rotational  frequency  of  dis¬ 
turbances  vs  magnet  cur¬ 
rent,  with  arc  current  a 
parameter.  Magnetic  field 
on  axis,  approximately  10 
gauss  per  ampere  magnet 
current. 


MAGNET  CURRENT  (AMPERES) 


Fig.  VII-8.  Effect  of  background  pressure  on  rotational  period. 

Magnet  current,  40  amps;  arc  current,  10  amps. 


QPR  No.  69 


56 


(VII.  PLASMA  ELECTRONICS) 


2.  Theoretical  Considerations 

We  postulate  that  the  frequency  of  rotation  of  the  cloud  is  such  that  the  azimuthal 
velocity  of  the  particles  is  equal  to  the  (Er/Bz)  drift  velocity.  The  direction  of  rotation 
agrees  with  this  hyphothesis,  since  the  plasma  potential  is  positive  with  respect  to  the 
vacuum -chamber  walls,  thereby  indicating  an  outward  radial  electric  field.  In  a  reflex 
discharge  the  plasma  is  contained  between  the  cathode  and  a  reflector,  both  between 
50  and  80  volts  negative  with  respect  to  the  vacuum-chamber  walls.  In  this  case  the 
sense  of  rotation  of  the  cloud  is  observed  to  be  opposite  to  that  in  the  straignt  (nonreflex) 
arc.  'T’he  frequency  of  rotation  is  almost  right  for  the  values  of  E  and  in  the  sys- 
tern,  and  has  no  relation  to  a  cyclotron  frequency  of  any  type  of  particle  present. 
Although  we  have  not  yet  established  the  mechanisms  for  this  instability,  we  are 

trying  to  compare  our  observations  with  those  predicted  for  the  "flute"  instability  and 

2 

for  the  helical  instability  of  Kadomtsev  and  Nedospasov. 

D.  L.  Morse 
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C.  COUPLING  OF  EMPTY -WAVEGUIDE  MODES,  ELECTROSTATIC  AND  MAGNE¬ 
TOSTATIC  MODES  IN  WAVEGUIDES  LOADED  WITH  GYROTROPIC  MEDIA 

In  this  report  we  present  a  new  approach  to  the  solution  of  wave  propagation  in  gyro- 
tropic  waveguides.  Chorney  used  an  argument  based  on  coupling  between  an  empty- 
waveguide  mode  and  a  quasi-static  mode  for  the  interpretation  of  a  particular  dispersion 
curve  obtained  by  computations  based  on  the  exact  determinantal  equation.1  Bers  sug¬ 
gested  the  examination  of  the  coupling  of  empty-waveguide  and  electrostatic  modes  for 
plasma-loaded  waveguides.  We  now  present  the  details  of  this  approach  and  illustrate 

its  use.  Recently  a  similar  coupllng-of -modes  approach  was  used  in  the  solution  of 

2  3 

ferrite  and  plasma  waveguides,  as  suggested  by  Auld.  *  However,  the  details  of  that 
approach  are  quite  different  from  ours,  as  will  be  pointed  out  here. 

1.  Uncoupled  Modes 

Consider  a  gyrotropic  medium  characterized  by  dielectric  and  magnetic  permittivity 
tensors 
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We  analyse  E  and  H  in  a  rotational  and  an  irrotational  part: 

E  =  Er  -  V®  (3a) 

H  =  Hr  -  V*  (3b) 


We  expand  Er  and  Hr  into  series  of  empty-waveguide  modes,  and  ®  and  ®  into  series  of 
electrostatic  and  magnetostatic  modes.  We  then  substitute  these  expansions  in  Max¬ 
well's  equations  for  the  gyrotropic  medium.  The  coupling  of  these  modes  because  of 
the  gyrotropic  nature  of  the  medium  will  then  be  established  with  the  aid  of  the  orthog¬ 
onality  relation  for  the  individual  modes.  The  orthogonality  relations  for  electrostatic 
modes  are 


IKI 


*i*m  da=  *£m 


(4a) 


fvL^T-Vfdas{lm-  <4b> 

and  for  the  magnetostatic  modes  are 

Yml  Vm*Ida-#mi  (5a) 

jv^Ur-V-r^da^.  (5b) 


The  main  difference  with  Auld  and  Eidson' s  treatment  lies  in  the  characterization  of 
each  quasi-static  mode,  not  by  its  propagation  constant  (v  m)  but  by  other  variables 
(plasma  frequency  in  the  case  of  plasma,  radian  frequency  in  the  case  of  ferrite). 


2.  Coupled-Mode  Theory 
We  define 
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Substituting  Eqs.  6-8  in  Maxwell's  equations  and  applying  the  orthogonality  relations, 

we  obtain 

y(V+aF)  =  bf+cM  (9a) 

V  (I+d  M)  =  eV  +  fF 

(9b) 

_  ft  _  vrt  - 
F  =  t1—  V  +  I 

l“€o  j“eo 

(10) 

_  rt  -  v3t_ 

M  =  -^-  I  +  J5.  v. 
j«p  3^r 

(11) 

Here,  the  dagger  means  complex  conjugate  of  the  transpose  of  a  matrix,  and 


ai£  =  Pei  j 

*  ^ 

da 

(12a) 

bin  ‘  j«*>C0 

penpei  I  K„  da  +  **  oj  ^Ti  ‘  LT  ‘  hTn  da 

(12b) 

cim  =  ^o 

hT.  •  LT  •  (-VT*m)  da 

(12c) 

dim  =  phi  . 

U-*m da 

( 1 2d) 

* 

ein  "  jwpQ  phiphn  I  L|(  da  +  J“€o  j)  eTi  KT  eTn  da 

(l2e) 

*-h 

K. 

II 

t-i. 

e 

o 

y  ^Ti  ’  ^T  '  da" 

(I2f) 

Lower-case  letters  for  potentials  denote  the  empty-waveguide  modes;  and  upper-case 
letters,  the  quasi-static  modes.  The  symbol  p  denotes  transverse  wave  numbers. 

3.  Discussion 

If  we  consider  all  of  the  empty-waveguide  modes,  the  quasi-static  modes  (Eqs.  10 
and  11)  are  eliminated,  and  we  get  the  equations  of  Chorney.1  This  is  to  be  expected. 
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since  the  empty-waveguide  modes  form  a  complete  set.  When  we  do  not  consider  the 
complete  set  of  empty-waveguide  modes,  the  first  two  of  Maxwell's  equations  are  sat¬ 
isfied  only  approximately,  and  the  divergence  equations  must  be  satisfied  independently; 
hence  Eqs.  10  and  11  are  then  independent  of  Eqs.  9a  and  9b. 

In  the  case  of  a  waveguide  completely  filled  by  a  dielectric  medium  each  electrostatic 
potential  is  related  to  only  one  transverse  electromagnetic  potential  (9.  =  —  -  -  --  <{>  -V 

r  ^  J 


Therefore  the  quasi -static  modes  are  eliminated  in  our  treatment.  This  does  not  happen 
for  ferrite -loaded  waveguides.  There  is  coupling  of  each  magnetostatic  potential  to  more 
than  one  transverse  electromagnetic  potential,  which  is  due  to  the  boundary  conditions. 

Our  treatment  is,  however,  most  useful  for  partially  filled  waveguides.  We  assume 
that  the  characteristic  tensors  are  different  from  1  in  the  space  occupied  by  the  gyro- 
tropic  medium,  and  equal  to  1  in  the  free  space  outside  the  medium.  The  formulation 
given  here  allows  for  the  tensors  to  be  functions  of  the  transverse  coordinates,  and  hence 
the  results  also  apply  directly  to  the  partially  filled  waveguide. 


4.  Example 

For  a  completely  filled  circular  cylindrical  plasma  waveguide  of  radius  a  we  exam¬ 
ine  the  coupling  among  the  TEJ0  mode,  TM1Q  mode,  and  the  Q.S.  j  mode.  There  exists 
a  magnetostatic  field  along  the  axis  of  the  waveguide.  The  propagation  constant  is  deter¬ 
mined  by  the  determinant  of  Eqs.  9  and  10.  The  dominant  term  in  the  expansion  of  this 
determinant  contains  the  three  assumed  modes  and  leads  to  the  following  approximate 
expression  for  the  propagation  constant: 


4 


?K1  2 

2 

2 

1'  ,  „  -  x  2/„  2  2  \ 

IW/el 

~  Ph  1  y 

+  3'2  K^u  (K||u  ”pel) 

1/2 


(13) 


The  frequencies  are  normalized  over  =  c/a  (u  =  w/u^),  and  the  propagation  constants 
over  *  l/a. 

Equation  1 1  gives  the  following  expressions  for  cutoffs  (ugc;uhc): 


uec_up  =  pel  (E-cutoffs)  (14a) 

2  4-0.8K£ 

%c"Kl --Phi  (H-cutoffs).  (14b) 


The  only  difference  from  the  exact  expressions  is  the  numerical  coefficient  0. 8 
that  appears  in  Eq.  14b. 
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For  the  resonances  we  find 
Cyclotron  resonances  (u  ■  uc-  or  uce) 


Y  2  pel 

K_ - *-1-9ua+^U  05) 

4 

In  the  exact  solution  we  have  2.  0  instead  of  1.9- 
Piasma  resonance  (u  =  u^g) 

V2K„ - >  p2^  (16) 

4 

which  is  identical  with  the  exact  solution. 

These  expressions  include  the  change  from  forward  waves  to  backward  waves  at 
cyclotron  resonance. 

P.  E.  Serafim,  A.  Bers 
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D.  ION  OSCILLATIONS  IN  NEUTRALIZED,  THIN  ELECTRON  BEAMS 

In  this  report  we  present  a  possible  theoretical  explanation  of  the  mechanism 
for  inducing  ion  oscillations  in  thin  electron  beams.  The  instability  is  of  a  "hose" 
type  (n  =  ±1)  in  which  the  ac  velocity  of  the  beam  is  largely  in  the  transverse 
direction. 

Consider  a  thin  cylindrical  electron  beam  of  radius  b  partially  or  fully  neutralized 
by  an  ion  background  of  the  same  radius.  It  is  assumed  that  there  are  no  stationary 
electrons.  The  beam  moves  with  dc  velocity  vQ  parallel  to  a  magnetic  field  BQ,  both 
in  the  z  direction.  We  assume  that  the  metal  drift-tube  radius  is  much  larger  than  b. 

For  such  a  thin  beam,  a  quasi-static  treatment  is  justified  when  the  beam  is  non- 

relativlstic.  We  consider  the  angular  dependences  e^n^  with  n  =  ±1,  and  look  for  wave 
solutions  of  the  form  e^1^'^.  If  we  take  the  small -argument  limit  of  the  appropriate 
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Bessel  functions,  the  potentials  in  the  vicinity  of  the  beam  are 

=  K-jjjj  e*^  inside 

<)>  =  K—  e±J*  outside, 

o  r 


(1) 


If  we  equate  the  discontinuity  in  the  normal  electric  field  to  the  surface  charge,  we 
obtain  the  following  dispersion  equation 


pb 


Pi 


Wwce)  +  “^“ci5 


=  2. 


(2) 


where  ur  =  w  -  PvQ,  <*>pb  is  the  beam-plasma  frequency,  wpi  is  the  ion-plasma  frequency, 
wce  is  the  electron -cyclotron  frequency,  and  is  the  ion-cyclotron  frequency.  The 
beam  waves  in  the  absence  of  the  ions  are  shown  in  Fig.  VII-9.  We  see  that  for 


Fig.  VII-9.  Electron-beam  waves  in  the  absence  of  ions. 


“pb  *  “ce’  we  *iave  usual  cyclotron  and  synchronous  waves  of  a  filamentary  beam. 
The  dispersion  in  the  presence  of  the  ions  is  sketched  in  Fig.  VII- 10  for  wpj  »  u>cj.  We 
see  that  there  are  convective  instabilities  for  both  n  =  ±1  which  have  an  infinite  rate  of 
growth  in  space  for  u  =  wp./VF .  This  instability  can  be  interpreted  as  a  coupling  of 
the  "generalized"  cyclotron  and  synchronous  waves  because  of  the  ions.  It  is  a  type  of 
reactive -medium  amplification  in  the  sense  that  the  ion  "oscillators"  appear  inductive 
just  below  the  resonant  frequency  wp^/  VF . 
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Upi  a/5 
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IMAGINARY  PART 
OF  COMPLEX  a 


Fig.  VII-10.  Electron  beam-ion  interaction. 

(a)  n  =  +1;  (b)  n  =  -1. 

We  should  note  that  the  infinite  gain  violates  the  assumption  of  (3b  «  1 
gain  that  is  comparable  to  l/b  is  extremely  large. 


E.  ELECTRON-CYCLOTRON  PLASMA  HEATING 

Investigations  on  the  production  of  hot  dense  plasmas  by  microwaves 
cyclotron  frequency  (Quarterly  Progress  Report  No.  64,  pages  103-104, 


;  however,  a 
R.  J.  Briggs 

at  the  electron- 
and  No.  65, 
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pages  95-97)  have  been  resumed.  An  experiment  at  medium  power  is  being  carried 
out  (Fig.  VII - 1 1);  it  utilizes  an  RK  62  pulsed  magnetron  at  X  =  10  cm,  and  a  cylindrical 


Fig.  VII-11.  Cavity  and  waveguide  assembly. 

cavity  with  concave  end  plates.  This  cavity  is  made  of  copper-plated  stainless  steel 
and  has  a  volume  of  approximately  25  liters,  so  that  it  has  a  high  mode  density  in  the 
neighborhood  of  3000  me.  Preliminary  experiments  were  made  with  l-psec  pulses  at 
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a  500/aec  repetition  rate,  and  available  peak  power  of  50-60  kw  produced  visible  dis¬ 
charges  at  air  pressures  of  approximately  10~*  mm  Hg,  with  x-ray  intensity  of  tens  of 
kilovolts. 

G.  Fiocco,  L.  D.  Smullin 


F.  ELECTRON-BEAM  INTERACTIONS  WITH  IONS  IN  A  WARM  PLASMA 


A  theoretical  study  is  being  made  of  the  various  beam-plasma  instabilities  in  a  warm 
plasma  in  order  to  determine  the  feasibility  of  heating  ions  directly  from  a  collective 
type  of  interaction.  In  this  report  we  summarize  some  results  of  a  one-dimensional 
analysis  of  the  problem. 

We  consider  the  one-dimensional  system  of  an  unbounded  electron  beam  moving 
with  velocity  vQ  through  an  unbounded  plasma  parallel  to  an  applied  steady  magnetic 
field.  Bq  (both  taken  to  be  in  the  z  direction).  The  waves  along  the  magnetic  field  have 
the  dependence  exp[j(ut-kz)];  these  waves  split  into  the  longitudinal  and  transverse  waves, 
which  are  uncoupled  and  therefore  can  be  considered  separately. 

1 .  Longitudinal  Interaction 

The  longitudinal  waves  have  only  a  longitudinal  component  of  the  electric  field  and 
zero  magnetic  field.  The  dispersion  equation  can  be  written 


iU  , 

pb 


(<—kvo) 


2  =  K„(«.k), 


(1) 


where  is  the  beam -plasma  frequency  and  K  ^  is  the  longitudinal  dielectric  constant 
of  the  plasma.  From  a  collisionless  kinetic  treatment,  assuming  cold  ions,  one  obtains1 


K  i  V  2  f00 


W  dvz 
("“kvz)2 


(2) 


where  fQ  is  the  unperturbed  velocity  distribution  and  upi  and  <*>pe  are  the  ion-  and 
electron-plasma  frequencies,  respectively.  From  a  transport  equation  model,  assuming 
an  isotropic  pressure,  one  has1 


pi 

K,|  =  1  -  u2 


Pe 


-2  -  k2v^ ; 


(3) 


where  VTg  is  the  average  thermal  velocity  of  the  plasma  electrons.  It  can  be  shown 
that  the  two  formulations  give  identical  results  when  (w/k)  is  either  much  larger  or  much 
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smaller  than  V^,g,  even  though  they  are  derived  from  quite  different  physical  assump¬ 
tions  . 

a.  Weak  Beam 

For  a  beam  density  that  is  much  less  than  the  plasma  density,  to  solve  for  the  beam 
waves  we  can  set  k  =  (w/v0)  +  6  and  assume  that  6  « u>/vQ.  Therefore, 

A  necessary,  but  not  sufficient,  condition  for  the  validity  of  this  approximation  is  that 
(nb/np)(Te/2Vo)  «  1,  where  n^  and  np  are  the  beam  and  plasma  densities,  Tg  is  the 
electron  temperature,  and  VQ  is  the  voltage  corresponding  to  the  beam  velocity  vQ.  For 
Te/VQ  «  1,  much  stronger  conditions  on  n^/np  are  required. 

For  the  transport  equation  model,  Eq.  3,  we  see  that  a  convective  instability  is 
obtained  in  the  regions  in  which  [Kj|]u=:i(v0  is  negative,  and  that  the  maximum  spacial 
growth  rate  occurs  near  the  synchronism  of  the  plasma  wave  with  the  beam  ([K||]u=kv  = 
0).  The  mechanism  here  is  the  well-known  reactive-medium  amplification:  when  a  ° 
bunched  electron  beam  passes  through  a  medium  with  a  negative  dielectric  constant  (an 
"inductive  medium"),  the  electrons  in  a  bunch  attract  rather  than  repel  and  hence  the 
bunching  is  further  enhanced.  In  Fig.  VII- 12  the  regions  in  which  K  (|  <  0  are  plotted 
in  the  w-k  plane.  We  see  that  for  2Vq  >  Tg  the  maximum  amplification  occurs  above 


8K 


8k 


lu)-kv_ 


(4) 


k 


Fig.  VII-12.  Regions  in  which  K  ||  <  0. 


2 

<*>  _.  This  amplification  has  been  considered  by  a  number  of  authors.  For  (m/M)T  < 
2Vq  <  Tg ,  there  is  no  instability.  For  2VQ  <  (m/M)T„,  there  is  again  amplification, 
which  now  is  a  maximum  at  a  frequency  less  than  wpJ;  the  growth  rate  in  space  is 
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Im  k  = 


jl/2  “pi 
24/3  V0 


2V  \1/2 
M  £Vo\ 

’“Vj 


(5) 


which  is  less  than  “pi/v0  for  the  conditions  assumed  here. 

The  collisionless  kinetic  treatment,  Eq.  2,  gives  similar  results  except  in  the  region 
in  which  2VQ  is  comparable  to  Tfi.  The  dielectric  constant  [K||]u_kv  is  now  complex; 

it  can  be  shown  that  this  fact  leads  to  a  convective  instability  even  in  the  region  in  which 
2Vo  <  T  .  The  mechanism  for  the  instability  is  now  of  a  "resistive-medium"  type3;  the 
novel  feature  in  the  present  analysis  is  that  the  loss  in  the  medium  comes  about  because 
of  the  Landau  damping.  This  amplification  is  plotted  in  Fig.  VII-13  for  a  Maxwellian 


Fig.  VII-13.  Resistive-medium  amplification. 


l  /z 

distribution  of  velocities.  The  frequency  of  maximum  gain  is  of  the  order  of  (2VQ/Te)  ' 
wpe  for  2Vq  <  Tg;  however,  the  peak  of  the  gain  decreases  rapidly  if  2VQ/Te  is  decreased, 
since  the  Landau  damping  is  then  much  less. 

b .  Strong  Beam 

If  the  beam  density  is  large  enough  so  that  (n]J/npHTe/2V0)  >  1,  a  very  interesting 
transition  takes  place.  This  transition  can  be  anticipated  by  looking  at  k  — «  as  w  — 
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Fig.  VII-15.  (a)  Longitudinal  dispersion  n^/np  =  10' 
(b)  Detail  at  low  frequency. 
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pe 
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JJ  , 

pb 


o  J 


Pi 


-  1 . 


(6) 


We  see  that  for  (n./n  )(T  /2V  )  >  1,  real  k  with  k  -*oo  is  obtained  for  u>  >  w  .  rather 
b  p  e  o  pi 

than  w  <  In  Fig.  VII-14,  the  complete  dispersion  is  presented  with  the  pressure 
formulation  used  for  simplicity  and  with  a  proton  plasma  having  the  system  parameters 
n^/np  =  10-2  and  (nb/np)(Te/2VQ)  =0.1.  In  Fig.  VII-14,  u  is  real  for  all  real  k  and  the 
complex  roots  of  k  represent  evanescent  waves.  In  Fig.  VII-15  the  dispersion  is  plotted 
for  (n^n  )(Te/2VQ)  =  4;  we  see  that  now  complex  u  for  real  k  is  obtained  (by  counting 
the  real  roots),  and  the  complex  roots  of  k  occurring  for  u>  €  u  .  now  represent  a  con¬ 
vective  instability  with  the  spacial  growth  rate  tending  to  infinity  at  u  =  u^.. 

If  (nb/np)(Te/2VQ)  is  only  slightly  less  than  one,  there  is  still  a  convective  instabil¬ 
ity  but  the  spacial  rate  of  growth  is  no  longer  infinite.  This  transition  is  illustrated 
in  Figs.  VII-16  and  VII-17,  in  which  the  low-frequency  dispersion  is  plotted  for 
(nb/np)(Te/2Vo)  =  0.5  and  0.  25.  We  see  that  the  instability  has  disappeared  in  the  latter 
case  (Fig.  VII-17). 

The  condition  for  infinite  gain  at  can  also  be  written  as  X.pb  <  X.D,  where  X.pb  = 
2rrvo/<Opb  is  the  space-charge  wavelength  and  X.D  =  2lTV-j’eA)pe  is  the  Debye  length  in  the 
plasma.  This  form  is  physically  appealing,  since  it  suggests  that  the  "beam  wavelength" 
should  fit  inside  a  Debye  sphere  in  order  to  have  strong  interaction  with  the  ions. 


2.  Transverse  Interactions 


The  dispersion  equation  for  the  transverse  waves  splits  into  separate  equations 
for  right  and  left  circularly  polarized  waves.  From  a  collisionless  kinetic  treatment, 
assuming  an  isotropic  equilibrium  velocity  distribution,  one  obtains 


k2c2  jr  fo(v)  d3y 

2“  //  w  J  M-kv  Tu 

<*>  z  c 


“pb(“~kVo) 


w  (u-kv  ±w  ) 
o  ce 


(7) 


where  the  sum  is  over  all  species  of  the  plasma,  and  wce  and  are  the  electron-  and 
ion-cyclotron  frequencies,  respectively.  For  a  relativistic  beam,  the  relativistic 
(transverse)  mass  should  be  used  in  computing  Upb  and  the  <*>ce  of  the  beam,  that  is, 

(8) 


where  mQ  is  the  rest  mass.  It  should  also  be  noted  that  the  inclusion  of  temperature 
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by  means  of  the  transport  equations  with  an  isotropic  pressure  would  predict  no  effect 
of  temperature,  since  there  is  no  bunching  associated  with  the  transverse  waves. 


a.  Cold  Plasma 


The  transverse  wave  interactions  in  a  cold  plasma  were  considered  in  Quarterly 
Progress  Report  No.  6?  (pages  80-83).  In  that  report,  the  conditions  for  charge  neu¬ 
trality  were  not  carefully  considered;  consideration  of  these  conditions  here  leads  to  a 
slight  change  in  the  very  low  frequency  nonconvective  instability.  For  a  cold  plasma, 
Eq.  7  becomes 


.  2  2 
k  c 


=  1 


pe 


Pi 


“pb(“"kVo) 


«(«±wCe>  “<“T“ci^  “  <“-kv0±uce) 


(9) 


Requiring  that  the  ion  density  in  the  plasma  just  equal  the  beam  density  plus  the  plasma 
electron  density,  and  assuming  that  «  «  and  kvQ  «  uce,  for  the  left  polarized  wave 
we  obtain  for  Eq.  9 


w2  =  k(k-k  )  u2  (10) 

o  a 

where  uq  is  the  Alfv6n  speed  in  the  plasma,  kQ  =  PoJ0t,/®0'  Job  is  the  beam  current 
density,  and  we  have  assumed  that  uq  «  c.  This  expression  is  a  valid  approximation 
for  k  «  kQ  if  kQ  «  “ce/v0,  which  requires  that  (  upb/“ce)  (vq/c2)  «  1  •  This  dispersion 
is  plotted  in  Fig.  VII-18;  we  see  that  complex  u  is  obtained  for  real  k  between  zero 
and  kD,  whereas  k  is  real  for  all  real  <«>  in  this  region;  these  results  indicate  a  non¬ 
convective  instability.  It  can  be  shown  that  for  such  an  instability  to  be  excited,  the 
length  of  the  system  much  exceed  a  certain  critical  "starting  length."  For  perfect  elec¬ 
tric  conductors  terminating  the  system,  the  shortest  starting  length  is  given  by 


L  =—  - 
^start  k 


2*Bo 

>Vo 


(11) 


It  is  interesting  to  note  also  that  this  instability  corresponds  to  a  pure  exponential  growth 
in  time  of  an  initial  spacial  pattern  of  particle  velocities. 

An  analysis  of  the  right  polarized  wave  shows  that  a  similar  nonconvective  instabil¬ 
ity  is  obtained,  only  with  -kQ  appearing  in  Eq.  10.  It  is  easily  shown  that  this 
represents  exactly  the  same  physical  phenomena. 

b.  Warm  Plasma 


It  can  be  shown  that  the  electron  temperature  has  essentially  no  effect  on  the  con¬ 
vective  instability  near  u>ci,  as  would  be  expected  physically.  From  Eq.7  we  see  that 
for  the  ion  temperature  to  be  neglected  it  is  necessary  that  kV.pj  «  «ci  -  u  near  the 
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REAL  u 

IMAGINARY  u 


k 


Fig.  VII-18. 


Low-frequency  transverse  instability  (left  polarized). 


intersection  of  the  plasma  wave  and  the  beam  cyclotron  wave,  where  VTi  is  the  average 
thermal  velocity  of  the  ions.  This  requires  that 


mv 

o 


(12) 


1  ? 

(As  an  example,  for  protons  of  density  101  /cc,  magnetic  field  of  1000  gauss,  and  v  /c  = 

-7  ° 

0.Z,  Eq.  12  requires  that  the  ion  temperature  be  less  than  10  volts,  or  much  less  than 
1*K.)  We  conclude  that  for  most  cases  of  practical  interest  the  ion  temperature  cannot 
be  neglected  in  discussing  the  interaction  at  the  ion-cyclotron  frequency. 

One  finds,  however,  that  there  is  still  a  convective  instability  near  <o  =  u>ci  when  the 
ion  temperature  is  included  in  Eq.  7.  This  interaction  is  again  of  a  "resistive -medium" 
type,  the  physical  origin  of  the  "loss"  mechanism  now  being  the  cyclotron  damping  in 
the  plasma.  For  reasonable  ion  temperatures,  such  that  the  inequality  of  Eq.  14  is 
reversed,  the  maximum  growth  rate  in  space  is 


Im  k  = 


2  2 
0)  .01  , 
pi  pb 


(13) 


(For  the  example  cited  below  Eq.  14,  by  assuming  a  beam  density  of  10^°/cc  and  ions  of 
0.1  ev,  this  growth  rate  is  of  the  order  of  10-5/cm,  which  is  a  very  weak  amplification.) 

The  effect  of  temperature  on  the  nonconvective  instability,  Eq.  10,  is  quite  small, 
as  would  be  expected,  since  the  frequency  is  far  from  resonance.  To  neglect  temperature 
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effects,  it  is  sufficient  to  require  (see  Eq.  7)  that  kVTe/u>ce  and  kVTi/uci  be  much  less 
than  one  for  the  maximum  wave  number  that  must  be  considered.  This  requires  that 


(As  an  example,  for  protons  in  a  magnetic  field  of  1000  gauss,  beam  density  of  101  /cc, 

4 

and  V  =  10  volts,  inequalities  (14)  and  (15)  are  satisfied  for  ion  temperatures  less 

°  5  8 

than  4X10  volts  and  electron  temperatures  less  than  8X10  volts.) 

It  might  be  expected  that  collisions  would  have  an  important  effect  on  this  noncon- 

vective  instability,  since  o  is  so  small;  however,  it  can  be  shown  that  the  starting 

length  is  not  altered  if  a  relaxation  type  of  collision  frequency  is  included,  although  the 

peak  growth  rate  in  time  is  reduced. 

R.  J.  Briggs,  A.  Bers 
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G.  SCATTERING  OF  LIGHT  FROM  (PLASMA)  ELECTRONS  III 

By  using  slight  improvements  in  the  techniques  developed  in  a  previously  reported 
electron-beam  experiment,  '  we  have  observed  scattered  optical  radiation  from  the 
electrons  in  a  dc  thermal  plasma.  Development  of  these  techniques  will  provide  a  pow¬ 
erful  plasma  diagnostic  tool,  since  it  is  possible  to  obtain  detailed  measurements  of 
the  velocity  distribution  and  density  of  the  plasma  electrons.  These  measurements 
can  be  made  with  very  high  spatial  resolution  (~1  mm^);  also,  time  resolution  of  these 
quantities  should  not  present  any  great  problems.  Furthermore,  this  information  is 
obtained  without  the  introduction  of  probes,  or,  as  is  the  case  in  some  spectroscopic 
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LASER 

RADIATION 


Fig.  VII-19.  Diagram  used  in  derivation  of  spectrum  of  scattered  radiation. 


measurements,  any  impurity  gases. 

The  derivation  of  the  expected  spectrum  is  made  as  follows.  Consider  an  electron 

in  velocity  space  at  A  with  velocity  equal  to  v  (see  Fig.  VII-19);  then  this  electron 

will  "see"  an  input  wavelength  from  the  laser  which  is  Doppler-shifted  by  an  amount, 

AX,  =  X  -  X,  and  is  obtained  from  the  well-known  formula 
i  o  1 


(XQ  is  the  input  wavelength  and  X  is  the  wavelength  observed  by  an  observer  moving  with 
velpcity  v  relative  to  the  source  of  XQ.) 

This  radiation  is  observed  along  the  direction  shown,  and  hence  the  observer  will 
see  radiation  of  wavelength  Xj  further  Doppler-shifted  by  an  amount  depending  on  the 
component  of  velocity  in  the  line  of  the  detector.  Let  X2  be  the  wavelength  of  observed 
radiation. 

* 

Similarly,  an  electron  at  B  with  velocity  equal  to  v  will  be  observed  to  radiate 
at  a  wavelength  X2  in  the  laboratory  coordinates.  We  see,  therefore,  that  all  electrons 
whose  velocity  vector  terminates  on  the  extension  of  AB  will  be  observed  to  give  radi¬ 
ation  shifted  from  the  input  wavelength  by  an  amount  AX2  =  XQ  -  X2- 

The  total  number  of  such  electrons  is  simply  the  number  lying  in  that  slice  of  veloc¬ 
ity  space  dVa  thick  and  Va  away  from  the  origin  and  is  given  by 

dNy  =Nf(Va)dVa, 
a 

where  N  is  the  total  number  of  electrons,  and  f  is  the  distribution  function.  For  a 
Maxwell -Boltzmann  distribution  of  electron  velocities  we  have 
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dN, 


1/2 


mV 


N  (  m  Y'  1°  W 

,jr-\  2kT/  exp  2  kT  /dVa' 


which,  in  turn,  can  be  written  in  terms  of  the  wavelength  shift  AX2  because 


and 


V2  =  — (1  +  cos  6) 

a  L 


*  ^^2 
V  =  ^-c. 


Hence 


dN, 


N 


n/2  ir 


r  m 
2kT 


U/2 


(1  +  cos  e)1^2  exp 
xo 


me  (1  +  cose)  AX* 
4k\2  Te 


Ik, 


where  d\  is  the  width  of  the  filter  used  to  select  wavelength  shift  AX..,.  For  6  =  90°  (the 
value  used  in  our  experiment)  the  expression  simply  reduces  to 


dNy  =  A  exp  I-  x-tf—  jdk. 


AX: 


where 


A  = 


Nd\ 

2rr 


!kT.  ko 


and 


X  = 


4kx; 


Hence  a  semi-log  plot  of  signal  amplitude  versus  AX*  will  give  Tg  for  a  Maxwell- 
Boltzmann  distribution.  The  intensity  of  scattered  radiation  dlg  of  wavelength  X2  is 

,  2 


dl 


~[z?J  •*"  ♦d07ifW  s“pr'TT/ 


dX 


for  plane  polarized  input,  of  intensity  IQ  in  an  area  a.  The  angle  $  is  the  angle  between 
the  plane  of  polarization  and  the  direction  of  observation,  and  V  is  the  volume  in  the 
field  of  view.  Therefore,  by  making  an  absolute  measurement  of  the  intensity  of  scat¬ 
tered  radiation,  we  can  also  determine  the  electron  density  ng  in  the  plasma. 
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Fig.  VII-20.  Schematic  diagram  of  apparatus.  Plan  view. 


FILTER 


Fig.  VII-21.  Cross-section  view  of  apparatus. 


A  schematic  diagram  of  the  apparatus  is  shown  in  Fig.  VII-20  and  a  cross  section 
through  the  plasma-laser  interaction  region  is  shown  in  Fig.  VII-21.  The  plasma  source 
is  the  hollow- cathode  discharge  developed  in  the  Research  Laboratory  of  Electronics, 
a  full  description  of  which  has  been  given.  *  By  using  this  device,  an  argon  plasma  with 
an  electron  density  of  ~10  electrons  cm  and  electron  temperature  of  a  few  electron 
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O 

volts  is  produced  along  the  axis  of  the  machine.  Twenty  joules  of  light  at  6934  A  from 
a  ruby  laser  is  passed  through  the  plasma  column  before  being  absorbed  in  the  specially 
designed  dump  chamber,  the  principle  of  which  is  evident  from  the  diagram.  A  similar 
dump  forms  a  black  background  for  the  receiving  lens  whose  field  of  view  is  limited 
by  the  iris.  The  scattered  radiation  is  collimated  before  reaching  the  tunable  interfer- 

O 

ence  filter  that  has  a  5  A  passband,  the  center  of  which  is  continuously  variable  from 
6940  A  to  6700  A.  The  detector  is  an  EMI  9558  A  photomultiplier. 


WAVELENGTH  A 

Fig.  VII-22.  Results  of  series  3  experiment  (argon  plasma  stream). 


The  results  of  our  first  experiment  (series  3),  in  which  we  obtained  the  spectrum 
of  the  scattered  radiation,  are  given  in  Fig.  VII-22  in  which  the  amplitude  of  the  detected 
signal  (in  arbitrary  units)  is  plotted  against  wavelength.  It  is  stressed,  however,  that 
these  results  are  of  a  preliminary  nature.  The  bottom  curve  is  the  spectrum  obtained 
in  the  absence  of  plasma  and  represents  the  effect  of  radiation  scattered  from  the  walls 
of  the  vacuum  chamber,  and  so  forth,  breaking  through  the  filter.  The  middle  curve  shows 
the  detector  output  when  the  laser  is  fired  into  the  plasma,  and  the  top  curve  represents 
the  difference  between  these  two  curves  on  an  expanded  scale. 

The  shape  of  the  top  curve  shows  two  peculiar  effects: 

O 

(a)  the  peak  centered  at  6805  A,  and 

(b)  the  curve  does  not  go  to  zero  in  the  wavelength  range  scanned. 

It  is  difficult  to  attribute  either  of  these  effects  to  Thomson  scattering  from  the 
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Fig.  VII-23.  Results  of  series  3  experiment. 

plasma  electrons,  and  it  is  tentatively  suggested  that  these  effects  arise  from  the  emis¬ 
sion  of  radiation  from  the  plasma  as  a  result  of  the  laser  input.  The  large  peak  at 

O 

6805  A  is  perhaps  due  to  Raman  scattering  (the  vibrational  line  being  broadened  by  rota¬ 
tional  lines). 

We  have  some  evidence  to  show  that  parts  of  the  plasma  emit  radiation  of  fairly 
uniform  intensity  over  a  wide  range  of  wavelengths  ~0.2  msec  after  the  laser  pulse  is 
turned  on  —  this  phenomenon  suggests  the  idea  of  optical  pumping  of  some  levels  in  either 
excited  states  of  A,  A*  or  A+  which  then  reradiate.  If  this  unresolved  effect  of  ampli¬ 
tude  X  is  subtracted  from  the  signal  amplitude  near  the  laser  line,  that  is,  we  assume 

that  there  are  no  electrons  present  with  energies  greater  than  10  ev,  and  plot  signal 
2 

amplitude  versus  AX.,,  we  obtain  the  result  shown  in  Fig.  VII-23.  The  value  of  electron 
temperature  obtained  from  the  gradient  of  this  line  is  1 . 2  ev. 

E.  Thompson,  G.  Fiocco 
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H.  USE  OF  FISSILE  NUCLIDES  IN  FUSION  REACTOR  BLANKETS 

The  preliminary  investigation  of  this  problem  has  been  completed  and  a  final  report, 
in  the  form  of  an  S.M.  thesis  to  be  submitted  to  the  Department  of  Nuclear  Engineering, 
M.I.  T.,  is  being  prepared. ^ 

The  author's  conclusions,  based  on  tritium  regeneration  and  heating-rate  calcula¬ 
tions,  regarding  the  feasibility  and  the  practicality  of  the  three  blanket  configurations 
2  3 

described  previously  *  are  as  follows. 

(a)  The  metallic  thorium  first -wall  blanket  assemblies  are  rejected  on  the  ground 
that  the  thermal  stress  in  the  first  wall  limits  the  reactor  operation  to  impractically 
low  power  levels. 

(b)  The  blanket  systems  with  depleted  UF4  fused  salts  in  the  first-wall  coolant 
region  are  restricted  to  low -power  operation  (by  heat  transfer  limitations  arising  frpm 
the  liberation  of  from  77  per  cent  to  1  35  per  cent  of  the  incident  neutron  energy  flux 

in  the  narrow  coolant  region). 

(c)  The  blanket  configurations  containing  a  high  UF^  constituent  (17-27  mole  frac¬ 
tion  percentage)  fused  salt  in  the  primary  attenuator  with  approximately  50  per  cent  Li^ 
enrichment  appear  to  be  feasible  and  practical .  The  calculated  tritium  breeding  ratios 
are  in  excess  of  1.2;  thus  a  self-sustaining  tritium  cycle  is  ensured.  The  heat-generation 

rates  in  the  Mo  first  wall  and  in  the  coolant  region  are  comparable  to  the  nonfissile  sys- 

4  5 

terns  studied  by  Impink  and  Homeyer  *  ;  therefore  operation  at  the  same  incident  energy 

flux  of  from  4  to  5  is  possible.  The  fast  fission  reaction  doubles  the  recoverable 
in 

energy  liberated  in  the  blanket,  and  approximately  90  per  cent  of  the  heat  is  generated 
in  the  large  primary  attenuator  region.  The  manufacture  of  one  plutonium  239  atom  per 
five  fusion  reactions  may  have  economic  implications  in  terms  of  fission  reactor  fuel. 

L.  N.  Lontai 
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I.  FUSION  REACTOR  BLANKET  EXPERIMENT 
1 .  Introduction 

Experiments  have  been  initiated  to  study  the  neutronics  of  models  of  a  tritium  regen¬ 
erating  blanket  to  surround  a  power -producing  deuterium-tritium  cycle  thermonuclear 

1-3 

reactor.  The  objects  of  the  experiments  are  (a)  to  check  calculational  methods  used 
to  determine  neutron  fluxes  and  tritium  production  rates,  and  (b)  to  provide  a  versatile 
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Fig.  VII-24.  Mock-up  of  a  fusion  reactor  blanket. 
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means  of  studying  effects  of  changes  of  materials  and  geometry  on  neutron  flux  and  trit¬ 
ium  production. 

2-5 

The  blanket  is  envisioned  as  a  series  of  annular  regions  surrounding  a  cylindrical 
plasma,  which  contain  materials  for  neutron  multiplication  and  moderation,  for  removal 
of  heat  generated  in  the  plasma  and  in  the  blanket,  and  for  tritium  production  by  the 
Li6(n,  a)H3  and  Li7(n, na)H3  reactions.  In  the  experiments,  a  core  is  removed  from  this 
blanket,  and  the  resulting  cylinder  is  irradiated  with  a  point  source  of  14-Mev  neutrons. 
A  schematic  diagram  of  the  experimental  arrangement  is  shown  in  Fig.  VII-24. 

2.  Neutron  Source 

The  14-Mev  neutrons  for  the  experiment  are  supplied  by  a  tritium  gas  target,  shown 
in  Fig.  VII-25.  The  tritium  is  contained  in  a  stainless -steel  tube  l/4  inch  ID  and  1.3 
inches  long.  The  target  window,  a  disc  of  commercial  aluminum  foil  33.8  p  thick,  is 
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Fig.  VII-25.  Gas  target. 
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held  to  the  polished  surface  on  a  flange  on  the  open  end  of  the  cylinder.  A  copper  washer 
seals  the  window  to  the  cylinder;  the  other  side  of  the  washer  forms  a  seal  with  the 
target  holder.  Preliminary  tests  indicate  that  the  window  can  withstand  a  6 -pa  beam  of 
2.7-Mev  deuterons  almost  indefinitely.  The  incident  2.7-Mev  beam  of  deuterons  is 
slowed  to  400  kev  in  the  window,  whence  it  enters  the  gas  chamber,  where  it  is  slowed 

U 

to  40  kev.  The  calculated  yield  is  9  X  10  neutrons  per  second  per  microampere. 

The  tritium  is  supplied  by  a  uranium  tritide  furnace  similar  to  that  described  by 
Johnson  and  Bant  a.*1 

The  2. 7-Mev  deuterons  are  supplied  by  the  Van  de  Graaff  accelerator  at  the  Air 
Force  Cambridge  Research  Laboratory,  Bedford,  Massachusetts. 

3.  The  Blanket  Mock-Up 

Most  of  the  materials  for  the  blanket,  such  as  graphite  for  neutron-moderating 
regions,  are  available  in  sufficient  quantities  for  the  mock-up  assembly.  Approximately 
300  lb  of  lithium -beryllium  fluoride  (composition  2LiF  ■  BeF^)  to  simulate  the  fused-salt 
regions  have  been  supplied  by  the  Reactor  Chemistry  Division  of  the  Oak  Ridge  National 
Laboratory.  The  salt  is  sealed  in  12  aluminum  pans  18  inches  in  diameter,  and  1  inch 
thick. 

4.  Tritium  Counting 

The  tritium  produced  in  the  blanket  assembly  by  the  two  lithium  reactions  is  trapped 
in  the  crystal  lattice  of  the  lithium-beryllium  fluoride.  Salt  samples  will  be  melted  in 
copper  tube  furnaces;  the  tritium  thus  evolved  is  swept  into  a  proportional  counter,  by 
using  a  Teoppler  pump  to  complete  the  transfer.  A  mixture  of  10  per  cent  methane  and 
90  per  cent  argon  is  used  as  both  the  sweeping  gas  and  the  counting  gas .  After  counting 

5 

the  tritium  activity  (<10  dpm)  the  gases  in  the  proportional  counter  are  exhausted  out¬ 
side  the  laboratory.  Since  tritium  will  build  up  in  the  bulk  of  the  salt  in  the  assembly 
from  one  experiment  to  another,  fresh  samples  wrapped  in  aluminum  foil  will  be  used 
for  counting  in  each  experiment.  By  this  method,  the  tritium  production  rate  can  be 
measured  throughout  the  salt-containing  regions  of  the  assembly.  The  tritium -measuring 
system  has  been  checked  with  salt  samples  irradiated  by  thermal  neutrons  in  the  M.I.T. 
reactor . 

5.  Threshold  Detectors 
a .  Selection 

The  neutron  flux  in  the  blanket  mock-ups  will  be  measured  as  a  function  of  position 

in  the  blanket  and  of  the  neutron  energy  by  means  of  threshold  detectors.  After  a  search 

238  31  31 

among  publications  five  suitable  threshold  reactions  were  found:  U  (n,  f),  P  (n,  p)Si  , 
Fe5^(n,  p)Mn5^,  I127(n,  2n)I12*>,  and  F1^(n,  2n)F18.  Details  of  these  detectors  are  given 
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Table  VII- 1.  Threshold  detector  properties. 
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in  Table  VII-1 .  Several  factors  had  to  be  considered  in  choosing  the  reactions  to  be  used. 

Half-Life.  Since  it  will  require  about  one  hour  after  a  run  with  the  accelerator  to 
begin  counting  the  foils,  the  half-life  of  the  daughter  products  of  the  reactions  that  we 
use  cannot  be  much  less  than  1  hour,  or  the  activity  produced  will  have  decayed  before 
it  can  be  counted.  On  the  other  hand,  if  the  half-life  is  much  longer  than  several  hours, 
not  enough  activity  will  be  produced,  unless  the  cross  section  for  the  reaction  is  very 
large.  Also,  the  half-life  of  the  daughters  of  the  main  reactions  must  be  much  longer 
or  much  shorter  than  those  of  competing  reactions,  so  that  either  the  competing  activity 
can  be  allowed  to  decay  before  counting,  or  else  the  competing  activity  will  be  small. 

Cross  Section.  The  cross  section  for  the  reaction  should  be  an  accurately  known 
function  of  energy  in  the  region  <16.  5  Mev.  This  requirement  eliminates  most  of  the 
prospective  reactions  because  (a)  work  with  threshold  detectors  has  usually  been  done 
for  reactor  systems,  for  which  the  highest  energy  of  interest  is  ~2  Mev,  and  (b)  a  great 
deal  of  the  data  on  threshold  detectors  results  from  measuring  an  effective  cross  section 
for  a  neutron  energy  spectrum  that  is  almost  a  fission  spectrum.  From  calculations 
made  by  Impink, 7  we  conclude  that  the  spectrum  in  the  blanket  assembly  will  be  nothing 


Fig.  VU-26.  Cross  sections  for  threshold  detector  reactions.  The  curves  shown  are 

the  best  ones  from  available  data  (see  Table  VII-1).  The  curve  for  P31(n,  p) 
has  been  averaged  over  the  resonances  in  the  2-5-Mev  region;  beyond 
14  Mev,  it  was  extrapolated  on  a  logarithmic  plot. 
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like  a  fission  spectrum;  hence  data  on  effective  cross  sections  are  of  no  value  in  the 
present  work. 

The  cross  sections  for  the  reactions  used  are  shown  in  Fig.  VII-26.  In  general,  the 

g 

cross  sections  are  those  of  BNL-325  modified  by  later  measurements.  Except  for  the 
19  18 

reaction  F  (n,  2n)F  ,  the  cross  sections  measured  by  various  authors  are  in  good 

agreement.  In  the  case  of  the  fluorine  reaction  there  are  two  widely  varying  sets  of 

17  18 

measurements  of  <r(E),  one  by  French  researchers,  and  one  by  Russians.  The  Russian 
data  are  rather  carefully  documented,  but  the  French  values  are  simply  presented  as 
preliminary  results;  furthermore,  the  Russian  data  at  14  Mev  are  in  better  agreement 
with  the  work  of  others  who  have  measured  the  cross  section  only  at  this  energy.  Hence 
the  Russian  data  are  used  in  this  work. 

For  use  as  a  threshold  detector  the  cross  sections  for  the  P31(n,  p)Si31  reactions 
have  been  averaged  over  the  many  narrow  resonances  in  the  2-5  Mev  region.  The  cross- 
section  curve  for  this  reaction  is  extrapolated  logarithmically  above  14  Mev,  because 
of  the  lack  of  measurements  in  this  region.  The  extrapolation  seems  reasonable  from 
other  (n,  p)  reactions  that  were  studied. 

Foil  Material .  To  ease  the  problem  of  counting  the  foils  after  a  given  run, 
the  parent  isotope  must  be  in  a  form  that  does  not  require  special  handling 
procedures.  If  a  compound  is  to  be  used,  the  additional  elements  cannot  contrib¬ 
ute  too  much  to  the  background.  For  uranium  and  iron,  metal  foils  can  be 

used  to  satisfy  this  requirement.  The  carbon  in  the  Teflon  that  is  used  for  the 

1 2 

fluorine  does  not  interfere,  since  the  threshold  energy  for  the  C  (n,  2n)  reac¬ 
tion  (20  Mev)  is  above  the  maximum  neutron  energy  that  will  be  encountered  in 
these  experiments . 

Although  most  authors  suggest  using  ammonium  phosphate  as  a  phosphorous - 

containing  foil,  we  find  that  calcium  metaphosphate  (Ca(PO,)_)  is  a  better  mate- 

21  3  £ 
rial.  The  foils  were  manufactured  as  follows:  Monobasic  calcium  phosphate 

powder  (Ca(H2P04)2  •  H20)  is  heated  in  graphite  crucibles  to  2300“F,  at  which 

temperature  it  dehydrates,  decomposes  to  Ca(POj)2,  and  then  melts.  The 

resulting  clear  syrupy  fluid  is  poured  into  graphite  molds  (which  are  heated 

by  a  hot  plate),  and  pressed  into  shape,  before  molten  fluid  solidifies.  The 

resulting  clear  glass  pellets  (3/4  inch  in  diameter,  l/8  inch  thick)  are  annealed 

at  850  ®F  for  several  hours  to  remove  the  thermal  strains  introduced  in 

casting,  and  then  slowly  brought  to  room  temperature.  The  pellets  can  then 

be  handled  as  ordinary  glass,  although  they  are  not  quite  so  brittle.  They  are 

of  approximately  90  per  cent  theoretical  density,  because  of  air  bubbles  and 

bits  of  graphite  trapped  during  the  casting  process. 

177  77 

For  the  I  (n,  2n)I  reaction,  lead  iodide  appears  to  be  a  suitable  material. 
Hexaiodobenzene  is  also  being  investigated. 
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b.  Analysis  of  Data 


The  analysis  of  the  threshold  foil  data  is  based  on  an  orthonormal  polynomial 

23 

expansion  method  of  Lanning  and  Brown.  A  FORTRAN  II  program  has  been 
written  to  perform  the  data  analysis,  and  is  now  being  tested  with  reaction  rates 
calculated  by  Impink7  used. 

P.  S.  Spangler 
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A.  THERMIONIC  EMISSION  FROM  MONOCRYSTALLINE  TUNGSTEN  SURFACES 
IN  THE  PRESENCE  OF  CESIUM 


1 .  Introduction 

The  preliminary  results  of  a  recently  initiated  thermionic  investigation  are  reported 

here,  together  with  a  brief  description  of  our  apparatus .  The  primary  purpose  of  this 

study  is  to  attain  a  better  fundamental  understanding  of  the  emission  processes  that  occur 

at  the  electrodes  of  thermionic  energy  converters . 

A  well-defined  emitter  surface  is  necessary  in  order  to  obtain  experimental  results 

of  the  quality  that  is  most  useful  in  the  formulation  of  a  general  theory  of  thermionic 

processes.  Monocrystalline  surfaces  are  generally  accepted  as  being  the  best  specimens 

for  such  studies,  and  many  investigators  have  employed  them  with  varying  degrees  of 

1  2 

success.  Two  of  the  most  significant  experiments  are  those  of  Nichols  and  Smith, 
the  results  of  which  show  that  the  work  function  of  clean  tungsten  depends  markedly  on 
the  crystallographic  structure  of  the  particular  surface  that  is  being  studied.  The  close- 
packed  surfaces  generally  have  the  higher  work  functions,  the  approximate  range  for 
tungsten  being  from  4.  65  ev  down  to  4.  30  ev. 

Although  much  work  has  been  devoted  to  the  study  of  clean  monycrystalline  surfaces, 
there  have  been  very  few  quantitative  investigations  of  monocrystalline  surfaces  that 
are  partially  covered  with  a  layer  of  adsorbed  molecules,  a  problem  of  current  impor¬ 
tance  to  thermionic  energy  conversion  and  ion  propulsion.  Qualitative  observations 

3  4  5 

have  been  made  by  a  number  of  investigators,  '  and  Webster  has  obtained  significant 
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quantitative  results  for  a  few  of  the  most  conspicuous  crystallographic  directions  of 
several  refractory  metals.  The  results  show  that  cesium  affects  the  close-packed  planes 
most  strongly,  causing  their  work  functions  to  be  reduced  by  as  much  as  3  ev. 

There  is  a  definite  need  for  additional  research  in  this  area  since  the  existing 
theories  are  incomplete,  and  the  experimental  results  are  insufficient.  For  this  reason, 
the  present  investigation  has  been  designed  to  study  in  detail  the  dependence  of  the 
thermionic  properties  of  tungsten  on  crystallographic  direction,  surface  temperature, 
and  cesium  density. 

2 .  Apparatus 

An  experimental  tube  built  by  Salomon^  was  modified  for  the  present  investigation. 
The  design  is  basically  a  simplification  of  that  used  by  Nichols. *  The  test  specimen 
is  a  tungsten  filament  having  a  diameter  of  0.0049  inch  and  a  length  of  14  cm.  A  long 

4 

crystal  has  been  grown  in  this  filament  by  means  of  a  special  heating  schedule.  Crystals 
formed  in  this  manner  are  usually  oriented  in  such  a  manner  that  the  (110)  crystallo¬ 
graphic  direction  is  parallel  to  the  axis  of  the  wire;  this  means  that  all  crystallographic 
directions  that  are  normal  to  the  surface  of  the  filament  will  have  Miller  indices  of  the 

3 

form  (hhk).  Martin  has  shown  that  this  class  of  directions  includes  those  of  maximum 
and  minimum  emission;  therefore  it  is  most  desirable  for  thermionic  studies. 

The  filament  is  surrounded  by  a  three-piece  cylindrical  anode  (Fig.  VIII-1).  The 
two  endpieces  are  fixed  and  serve  as  guard  rings,  while  the  center  section  is  rotatable 
(by  means  of  magnets)  and  contains  a  small  slit  cut  parallel  to  the  axis  of  the  filament. 
The  diameter  of  the  center  section  is  1 . 9  cm,  and  the  slit  dimensions  are  0.07  cm  by 
1 . 01  cm,  so  that  the  subtended  angle  is  approximately  4.  23°.  By  applying  a  strong  elec¬ 
tric  field  between  the  filament  and  the  three-piece  anode,  one  can  cause  the  electrons 
emitted  from  the  surface  to  be  accelerated  radially  in  straight-line  paths.  The  slit  acts 
as  a  microscope  focused  on  a  small  area  of  the  emitting  surface,  thereby  allowing  only 

the  electrons  from  that  area  to  pass  through  the  slit  to  be  measured  at  the  collector. 

-4  2 

(The  approximate  value  of  this  area  for  the  present  tube  is  4.65X10  cm  if  the 
filament-anode  concentricity  is  perfect.)  It  is  therefore  possible,  at  least  in  principle, 
to  measure  the  emission  associated  with  the  different  crystallographic  directions  by 
rotating  the  slit  to  the  appropriate  positions. 

A  schematic  of  the  electrical  circuit  is  included  in  Fig.  VIII-1 .  A  dc  power  supply 
is  used  to  maintain  the  three-piece  anode  at  a  potential  of  +1 000  volts  with  respect  to 
the  filament.  The  cylindrical  collector  is  held  at  potential  of  only  10  volts  above  that 
of  the  filament  to  reduce  the  undesirable  effects  of  secondary  emission.  The  collector 
current  is  measured  with  a  suitable  electrometer. 

A  cesium  reservoir  is  connected  to  the  experimental  tube,  and  its  temperature  is 
regulated  in  order  to  attain  the  desired  density  of  the  cesium  vapor.  The  tungsten 
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Fig.  VIII- 1.  Experimental  apparatus. 


filament  is  heated  directly  by  ac  current,  and  its  temperature  is  determined  simply  from 
the  resistivity,  a  method  that  is  of  questionable  accuracy  at  temperatures  less  than 
1 300  °K  unless  the  appropriate  calibrations  and  corrections  are  made.1 

3.  Results 

The  tube  was  evacuated  by  means  of  a  mercury  diffusion-pump  system,  and  a  pres¬ 
sure  of  approximately  10~10  mm  Hg  was  attained  after  thorough  out-gassing.  The  fila¬ 
ment  was  flashed  a  number  of  times  until  the  emission  pattern  appeared  to  be  stable. 

The  filament  temperature  was  then  set  at  1610°K,  and  readings  of  the  collector  current 

were  taken  for  various  slit  positions.  A  portion  of  the  data  is  shown  in  Fig.  VIII-2,  and 

1  2 

its  general  pattern  is  similar  to  those  reported  by  Nichols  and  Smith.  Therefore,  by 
referring  to  their  work  it  was  possible  to  estimate  the  positions  of  the  prominent  crys¬ 
tallographic  directions  along  the  abscissa.  The  abscissa  actually  represents  the  angular 
position  of  the  slit;  the  pattern  in  Fig.  VIII-2  does  not  cover  the  full  360°  of  azimuth, 
as  its  scope  was  limited  to  approximately  210°. 
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CRYSTALLOGRAPHIC  DIRECTION  (ESTIMATED) 


Fig.  VIII-2.  Thermionic  emission  vs 
crystallographic  direc¬ 
tion  for  cesiated  and 
bare  tungsten. 


The  tube  was  then  sealed  off  from  the  vacuum  system,  and  cesium  was  introduced  by 
the  usual  method,  that  is,  breaking  a  cesium-filled  glass  capsule  in  the  reservoir.  With 
the  cesium  reservoir  at  room  temperature  (26*0.  the  filament  temperature  was  so 
adjusted  as  to  obtain  maximum  emission  in  the  (110)  direction.  (This  temperature  is 
not  known  accurately  because  a  detailed  calibration  has  not  yet  been  performed;  the  esti¬ 
mated  value  is  700  *K.)  The  resulting  emission  data  for  various  slit  positions  are 
included  in  Fig.  V1U-2  for  comparison  with  the  previous  data  obtained  for  bare  tungsten. 
Since  these  are  preliminary  results,  a  detailed  analysis  is  not  warranted  at  this  time. 
However,  it  is  interesting  to  note  that; 

(i)  The  presence  of  cesium  causes  the  maximum  emission  at  700*K  to  be  as  much 
as  two  orders  of  magnitude  greater  than  that  for  the  bare  surface  at  1610*K. 

(ii)  Surfaces  having  the  lowest  work  function  in  the  presence  of  cesium  are  those 
that  have  the  highest  "bare"  work  function. 

Both  of  these  general  observations  are  in  agreement  with  the  results  obtained  by 
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other  investigators.3'5  On  the  basis  of  these  preliminary  results,  we  conclude  that  the 
present  technique  may  be  employed  now  in  a  more  detailed  study  of  emission  processes. 

4.  Future  Plans 

The  filament  failed  before  more  reproducible  data  could  be  obtained.  A  new  filament 
is  being  prepared,  and  modifications  have  been  made  to  reduce  the  effects  of  stray  cur¬ 
rents  caused  by  leakage,  secondary  electrons,  and  photoemission.  The  immediate  plan 
is  to  obtain  data  for  other  conditions  of  emitter  temperature  and  cesium  density. 

The  close  cooperation  of  the  members  of  the  Physical  Electronics  Group  of  the 
Research  Laboratory  of  Electronics  has  been  invaluable  throughout  this  program. 

R.  E.  Stickney 
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B.  A-C  POWER  GENERATION  WITH  MAGNETOHYDRODYNAMIC  CONDUCTION 

MACHINES 

Magnetohydrodynamic  (MHD)  power  generation  is  attractive  because  of  the  inherent 
simplicity  of  the  system.  When  an  ionized  gas  is  used  as  the  working  fluid,  higher  tem¬ 
perature  operation  is  possible  with  an  MHD  generator.  This  makes  heat  rejection  easier 
in  space  applications  and  increases  thermal  efficiency  in  ground-based  installations. 

In  many  applications,  especially  those  in  which  the  power  is  to  be  processed,  it  is 
desirable  to  generate  ac  power.  Direct  generation  of  ac  power  in  MHD  machines  with 
ionized  gases  used  as  the  working  fluids  has  been  studied  extensively.3  The  several 
theoretical  possibilities  proposed  thus  far  have  been  shown  to  be  impractical  for  opera¬ 
tion  with  conditions  that  are  characteristic  of  combustion  temperatures,  either  because 
electrical  conductivity  of  the  gas  is  too  low  or  because  too  much  external  capacitance  is 
necessary  to  provide  the  reactive  power  to  excite  the  magnetic  field.1 

In  this  report  we  describe  a  new  type  of  ac  MHD  power  generator.  It  is  a  conduction 
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machine,  and  all  reactive  power  ia  handled  by  inductive  energy  storages.  Moreover, 
the  criterion  for  successful  operation  as  a  self-excited  ac  machine  is  simply  that  the 
machine  be  suitable  for  self -excitation  as  a  dc  machine. 

Although  the  system  is  most  promising  for  application  to  gaseous  MHD  systems,  the 
fundamental  mechanism  of  operation  will  be  demonstrated  theoretically  with  an  incom¬ 
pressible  fluid  mode.  This  is  standard  practice  with  many  types  of  MHD  conversion 
devices.  After  the  system  has  been  analyzed  with  scalar  conductivity  assumed,  a  con¬ 
figuration  will  be  presented  which  is  suitable  for  gases  exhibiting  a  large  Hall  effect. 
Finally,  some  estimates  of  performance  will  be  made  for  gas  properties  that  are  char¬ 
acteristic  of  temperatures  at  which  gases  can  be  contained  by  material  walls. 

1 .  System  with  Scalar  Conductivity 

Consider  the  system  illustrated  in  Fig.  VIII-3.  It  consists  of  a  channel  of  constant 
cross-section  area  having  the  dimensions  shown.  An  incompressible  fluid  flows  with 
the  velocity  v"  in  the  direction  shown.  There  are  two  pairs  of  electrodes  feeding,  through 
two  sets  of  coils,  two  identical  load  resistances  R^.  The  generators  formed  by  the  two 
pairs  of  electrodes  are  assumed  to  be  independent  insofar  as  the  fluid  is  concerned.  That 
is,  end  and  edge  effects  are  neglected  and  the  two  circuits  are  electrically  insulated 


/ 


Fig.  VIII-3.  Basic  configuration  with  solid  electrodes. 
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from  each  other.  Each  of  the  two  generators  has  two  magnet  coils,  one  carrying  cur¬ 
rent  ij  and  the  other  carrying  current  i2  in  the  directions  indicated.  In  practice,  each 
coil  would  have  a  second  half  under  the  channel  with  the  two  connected  series -aiding. 

Only  one-half  of  each  coil  is  shown  in  Fig.  VIII-3  for  simplicity.  Although  only  two  sets 
of  electrodes  and  coils  are  shown,  the  system  can  have  any  number  of  sets.  Furthermore, 
the  sets  can  be  connected  for  two-phase  operation  as  in  Fig.  VIII-3  or  for  operation  with 
any  number  of  phases.  In  the  last  case,  the  number  of  coils  associated  with  each  pair 
of  electrodes  must  be  increased,  but  the  essential  features  of  ac  operation  are  the  same 
as  those  described  below. 

Denoting  the  magnetic  flux  density  Bj  and  in  the  regions  of  the  two  electrodes, 
assuming  the  positive  directions  shown  in  Fig.  VIII-3,  assuming  the  coils  to  be  distrib¬ 
uted  so  that  the  flux  densities  are  essentially  uniform  over  the  length  f ,  and  assuming 
that  the  coils  produce  a  field  just  as  a  long  solenoid  (or  assuming  infinitely  permeable 
pole  pieces),  we  can  write 
Npr 

U) 


B1  - 


N  si 
_ o.  nro  . 

d  h  ~  d  *2 


Nu 
~d~1Z  + 


N  p 
nr  ^ 

d 


V 


(2) 


If  we  define  the  internal  resistance  for  one  pair  of  electrodes  as 


d  -  w 
Ri  =Td^' 


(3) 


where  <r  is  the  electrical  conductivity  of  the  fluid,  we  can  write  the  equations  for  the 
two  circuits  as 


dB 

vwBj  =  Rij  +  N£w-^- 

dB 

vwB2  =  Ri2  +  N<w-gf 


+  Nmfw-ar 

(4) 

dB 

-•v1*  d,  ■ 

(5) 

Here,  R  is  the  total  series  resistance  in  one  circuit  (R  =  R.  +  R.  +  R  ,  where  R  is 

L.  1  C  C 

the  resistance  of  the  magnet  coils). 

Substituting  Eqs .  1  and  Z  in  Eqs .  4  and  5  and  simplifying  yields 


Gil  ‘  Gmi2  =  Ril  +  L  "dt~ 


(6) 


di 


Gl2  +  Gm1l  =  Ri2  +  L"  dt  ' 


(7) 


in  which  the  parameters  G,  Gm>  and  L  are  defined  as 
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G  =■ 


Gm=- 


vwpQN 

"d 

vwp  N 


o  m 


L  =- 


u  £w 


("20- 


(8) 

(9) 

(10) 


Elimination  of  i2  from  Eqs.  6  and  7,  after  some  simplification,  yields 


d2i.  2(R-G)  di, 

- 5T+ - r - 3T-  + 


dt 


dt 


G^  +  (R-G)2 


it-0* 


(ID 


This  equation  has  solutions  of  the  form 


*1  =  Xo  exP 


/  (R-G)  \  /  G 

t)  008  (“r2" 


-t  +  e  , 


(12) 


where  I  and  9  are  constants  to  be  evaluated  from  initial  conditions.  It  is  clear  from 
o 

the  form  of  this  solution  that  when  R  >  G  the  response  is  a  damped  sinusoid,  when  R  < 

G  the  response  is  an  exponentially  growing  sinusoid,  and  when 

R  =  G  (13) 

the  response  is  a  sinusoid  of  constant  amplitude.  Reference  to  Eqs.  6  and  7  shows  that 
Eq.  13  is  the  condition  for  self-excited  operation  as  a  dc  generator. 

Henceforth  we  shall  assume  that  Eq.  13  is  satisfied,  in  which  case  Eqs.  6  and  7  can 
be  written 


di 


-Gm12  =  L  dt 


di2 

Vi  '  lTT' 


(14) 


(15) 


These  equations,  used  with  Eq.  12,  show  that  ij  and  i2  form  a  two-phase  set.  That  is, 
if 


ij  =  IQ  cos  <«mt+e), 
Eq.  14  shows  that 


(16) 


i2  =  IQ  sin  (wmt+e). 


(17) 


With  the  system  parameters  adjusted  to  satisfy  Eq.  13,  the  system  operates  in  the 
steady  Btate  with  frequency 
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and  with  currents  of  finite  amplitude.  Consequently,  power  is  dissipated  in  and  use¬ 
ful  ac  power  is  generated. 

Equation  13  is  the  condition  that  must  be  satisfied  for  self-excitation  as  a  dc  gener¬ 
ator  with  N  turns  on  the  field  winding,  and  a  coil  resistance  Rc  in  series  with  the  load. 
The  addition  of  the  coupling  windings  Nm  has  transformed  this  pair  of  self-excited  dc 
generators  into  a  two-phase,  self-excited  ac  generator.  The  price  for  making  the  change 
to  ac  operation  is  the  addition  of  the  Nm-turn  coils  that  occupy  space  and  contribute  to 
resistance  Rc>  Reactive  power  is  handled  completely  by  the  two  magnetic  storages 
having  flux  densities  Bj  and  B^-  Note  that  the  two  currents  have  90°  phase  difference. 
The  use  of  these  currents  in  Eqs.  1  and  2  shows  that  Bj  and  B2  also  have  90°  phase 
difference.  Thus  the  magnetic  energy  is  constant  in  magnitude  and  oscillates  between 
the  two  magnetic -field  storages. 


2.  Limiting  Parameters 


Assuming  steady-state  ac  operation,  we  can  use  the  one -dimensional  model  in  the 
analysis  above  to  identify  some  limiting  parameters  of  the  system. 

The  use  of  Eqs.  9  and  10  in  Eq.  18  yields 


u 


vN 


m 


i(nzO 


(19) 


It  will  be  desirable  to  operate  such  a  machine  at  as  high  a  frequency  as  possible.  With 
fixed  v  and  £,  Eq.  19  has  a  maximum  value  when  Nm  =  N  (we  assume  here,  of  course, 
that  a  minimum  of  one  turn  is  possible).  Thus,  we  can  define  the  maximum  frequency 
of  operation  as 


n  =  2£N 

with  the  constraint  that  N  =  N  . 

m 

Next,  we  can  assume  that  R.  +  R  is  some  multiple  of  R.;  thus 

i-i  C  1 

RT  +  R  =  «R. . 

L  c  l 


(20) 


(21) 


Then,  using  the  definition  of  R^  (Eq.  3)  and  the  condition  for  self- excitation  (Eq.  13),  we 
obtain 


N 


1  +  a 

TvH  ■ 


(22) 


Note  that  the  denominator  of  this  expression  is  a  magnetic  Reynolds  number  based  on 
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the  length  of  a  pair  of  electrodes . 

Substitution  of  Eq.  22  in  Eq.  20  yields 


^o1™2 

2(l+a) 


(23) 


This  result  shows  that  the  maximum  frequency  of  operation  depends  on  the  conductivity 
i t,  velocity  v,  and  degree  of  loading  a,  and  is  independent  of  the  size  of  the  device. 

To  obtain  some  idea  of  the  conditions  that  must  be  satisfied  with  an  ionized  gas  for 
ac  power  generation  of  this  type,  we  can  use  Eq.  23  to  calculate  values.  Assuming  a 
frequency  of  60  cps,  we  can  rewrite  Eq.  23  as 


(TV 

1  +  a 


=  6X10 


8 


Values  of  a,  <r,  and  v  that  satisfy  this  equation  are  given  in  Table  V1II-I. 


Table  VIII-I.  Velocity,  conductivity,  and  loading  factor  for  operation  at  60  cps. 


loading  factor  a 

0 

0.  5 

1.0 

2.0 

(v  m/sec) 

(<r  mhos/meter) 

1000 

600 

900 

1200 

1800 

1500 

270 

400 

530 

800 

2000 

150 

230 

300 

450 

2500 

96 

144 

190 

290 

3000 

67 

100 

130 

200 

3500 

49 

74 

100 

150 

4000 

37 

55 

75 

110 

It  is  apparent  from  the  numbers  in  Table  VIII-I  that  there  is  a  chance  of  using  this 
type  of  generator  with  combustion  gases  and  with  gases  having  increased  conductivity 
that  is  due  to  nonequilibrium  effects.  At  a  given  velocity  and  loading  factor  the  required 
conductivity  varies  linearly  with  frequency.  Thus  the  device  will  work  better  at  fre¬ 
quencies  below  60  cps.  For  gases  at  combustion  temperatures,  high-frequency  oper¬ 
ation  (such  as  400-1000  cps)  does  not  appear  possible  because  of  the  high  velocities 
and  conductivities  required. 

The  data  of  Table  VIII-I  can  be  used  with  an  assumed  length  in  Eq.  22  to  find  the 
number  of  turns  required.  To  show  that  the  number  of  turns  is  reasonable,  assume 
1  =  1  meter  and  use  «  =  1,  v  =  3000  m/sec,  <r  =  1  30  mhos/meter  from  Table  VIII-I  to 
find  N  =  4  turns;  this  value  is  certainly  reasonable  for  constructing  such  a  device.  The 
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other  conditions  of  Table  VIII-I  will  yield  numbers  of  turns  having  the  same  order  of 
magnitude . 

3.  Segmented  Electrodes  and  Tensor  Conductivity 

It  appears  from  the  numbers  mentioned  in  the  preceding  section  that  the  number  of 
turns  per  field  coil  may  be  small.  For  a  large  generator,  the  field  conductors  in  such 
a  system  may  be  so  large  that  eddy  currents  become  a  problem.  Moreover,  if  the  gas 
has  tensor  conductivity,  the  solid  electrodes  will  short  out  the  Hall  electric  fields  and 
degrade  performance. 

Both  of  these  deleterious  effects  can  be  reduced  by  using  segmented  electrodes.  The 
construction  for  two  electrode  segments  per  generator  section  is  illustrated  in 
Fig.  VIII-4.  The  essence  of  this  arrangement  is  that  each  electrode  pair  is  connected 
to  a  circuit  exactly  like  that  for  solid  electrodes  in  Fig.  VIII-3.  The  circuit  for  each 


Fig.  VIII-4.  Configuration  for  use  with  segmented  electrodes. 
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electrode  pair  is  electrically  insulated  from  all  other  circuits.  If  many  electrode  pairs 
are  used,  the  power  will  probably  be  extracted  by  inductive  coupling  to  magnetic  fields 
Bj  and  B 2,  or  by  replacing  R^  with  a  transformer  winding.  In  any  case,  the  generation 
of  ac  power  with  segmented  electrodes  does  not  create  the  problem  of  having  many  insu¬ 
lated  loads  that  a  dc  generator  does. 

For  the  analysis  assume  that  there  are  n  electrode  segments  per  generator  (of 
length  £)  and  that  the  coils  span  a  distance  £  in  the  flow  direction.  If  each  pair  of  elec¬ 
trodes  is  assumed  to  carry  the  same  current  as  other  segments  in  the  same  generator 
section,  then  for  the  flux  densities  Bj  and  B-,  (see  Eqs.  1  and  2)  we  can  write 


B 


1 


n%o 

a  1i 


mo 


(24) 


B 


2 


nN^o.  "Vo. 

~  d  1Z+  d  V 


(25) 


Assuming  a  one -dimensional  model,  the  internal  resistance  R!  associated  with  a  pair  of 
electrodes  (see  Eq.  3)  is 


R!  =  nR.  = 


nw 
£d<r  ‘ 


(26) 


The  equations  for  single  electrode  pairs  in  the  two  generator  sections  (see  Eqs.  4  and 
5)  are 


dB, 


dB„ 


vwB,  =  R'i.  +  N£w-3T-  +  N  £w 
11  at  m  at 


dB. 


dB, 


vwB,  =  R'i.  +  N£w  — jT N  £w  j* 

22  at  m  at 


(27) 

(28) 


where  R'  =  R^  +  R!  +  Rc  is  the  total  series  resistance  in  one  electrode -pair  circuit. 

Substituting  Eqs.  24  and  25  in  Eqs.  27  and  28  and  simplifying  (see  Eqs.  6  and  7)yields 


di 


G  i,  —  G _ i~  =  R'i,  +  L_ 


n  l  mn  c  l  n  at 

di2 

G  i,  +  G  i,  =  R'i,  +  L  -tt  . 
n  c  mn  l  z  n  at 

The  parameters  G G _ ,  and  L  are  defined  as 

n  mn  n 


G_  =■ 


mn 


nNu  vw 


nN„(i  vw 
mo 


(29) 

(30) 

(31) 

(32) 
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(33) 


For  this  system  self-excitation  is  achieved  when  the  parameters  are  adjusted  to  sat¬ 
isfy  the  condition 


G  =  R'. 
n 

With  this  condition  satisfied,  the  frequency  of  steady-state  operation  is 
G_ 


(34) 


(35) 


By  using  the  definitions  of  Eqs .  32  and  33,  the  frequency  can  be  expressed  as 


<(n2*>4) 

This  is  exactly  the  same  as  Eq.  19  for  the  system  with  solid  electrodes. 
The  maximum  frequency  occurs  when  Nm  =  N,  and  is 

“m  =‘2lN"- 

Defining  the  parameter  a  as 

RT  +  R  =  aR! 
l-  c  1 


we  can  use  the  self-excitation  condition,  Eq.  34,  with  Eq.  26  to  obtain 


1  + 


(36) 


(37) 


(38) 


(39) 


Note  that  the  results  of  Eqs.  36,  37,  and  39  are  independent  of  the  number  of  elec¬ 
trode  segments  and  are  the  same  as  Eqs.  19,  20,  and  22  for  the  system  with  solid  elec¬ 
trodes.  Thus,  all  of  the  discussion  of  limiting  parameters  in  the  preceding  section  holds 
also  for  the  case  of  segmented  electrodes.  The  only  effect  of  segmenting  the  electrodes 
is  to  require  a  number  of  windings  that  is  equal  to  the  number  of  electrodes,  each  winding 
having  the  same  number  of  turns  as  required  by  the  machine  with  solid  electrodes.  Each 
winding  carries  a  fraction  l/n  of  the  total  current  supplied  by  a  generator  section;  con¬ 
sequently,  the  volume  of  field  conductor  is  independent  of  the  number  of  electrode  seg¬ 
ments. 


4.  Discussion 


The  generators  discussed  in  this  report  on  an  idealized  basis  indicate  the  possibility 
of  their  being  applicable  to  systems  in  which  ionized  gases  are  used  at  temperatures 
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that  are  characteristic  of  combustion  processes.  Their  advantage  is  that  they  generate 
alternating  current  directly  and  use  only  magnetic  fields  for  energy  storage. 

The  theory  assumes  incompressible  flow.  The  basic  mechanism  still  occurs  in  com¬ 
pressible  flow,  the  velocity  being  the  average  flow  velocity  of  the  gas.  There  will  be 
compressibility  effects  that  need  to  be  evaluated  and  they  may  even  augment  the  conver¬ 
sion  process. 

The  generators  analyzed  here  are  two-phase  machines.  By  a  simple  extension  of  the 
concepts  we  can  devise  an  n-phase  machine  by  having  n  separate  sets  of  electrodes  and 
n  sets  of  magnet  coils  along  the  channel.  A  coupling  coil  is  required  in  each  set  for 
current  from  every  other  set  of  electrodes.  No  matter  how  many  phases  a  machine  may 
have,  it  can  be  reduced  to  an  equivalent  two-phase  machine  for  analysis.  Thus  the  pres¬ 
ent  analysis  is  complete  for  a  balanced  n-phase  machine. 

H.  H.  Woodson 
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C.  ELECTROHYDRODYNAMIC  WAVES  IN  ROTATIONAL  SYSTEMS 
1 .  Introduction 

Efforts  to  achieve  a  stable  magnetohydrodynamic  containment  have  led  some  inves¬ 
tigators  to  consider  the  effect  of  rotation.  From  everyday  experience,  one  would  suspect 
that  rotational  acceleration,  just  as  the  acceleration  of  gravity,  could  be  responsible 
for  stabilizing  or  unstabilizing  a  system,  the  choice  depending  on  the  configuration. 
Common  examples  are  the  gravitational  or  Taylor  instability,  produced  by  gravity  on  a 

top-heavy  fluid  interface1  or  the  stabilizing  influence  of  gravity  in  a  situation  in  which  a 

2 

perpendicular  electric  field  is  responsible  for  a  surface  instability. 

However,  it  is  well  known  that  rotational  motion  is  considerably  more  involved  than 
this.  In  addition  to  the  centrifugal  forces  experienced  in  pure  rotation,  particles  that 
are  in  motion  with  respect  to  a  rotating  frame  respond  to  a  Coriolis  acceleration,  which 

results  in  a  mode  splitting  that  has  been  well  known  since  the  last  century  in  conjunction 

3  4 

with  ordinary  gravity  waves,  but  has  been  studied  in  detail  only  recently. 

This  report  is  a  study  of  the  three  types  of  surface  interactions  in  circular  cylin¬ 
drical  geometry.  The  surface  not  only  has  an  equilibrium  curvature,  but  is  also  subject 
to  accelerations  associated  with  a  steady  rigid-body  equilibrium  rotation. 

There  are  two  reasons  for  wishing  to  study  systems  of  this  type.  The  great  interest 
in  stabilizing  MH  systems  in  connection  with  the  thermonuclear  program  has  already 
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been  mentioned.  The  antiduality  of  MH  and  EH  systems,  which  has  been  pointed 
out  elsewhere,  5  should  serve  to  show  that  study  of  the  EH-If  system  can  lead  to  a 
better  understanding  of  the  MH  counterpart.  An  experimental  investigation  of  the 
EH  problem  is  possible,  whereas  the  MH  system  can  be  considered  experimentally 
only  with  great  difficulty.  In  the  second  place,  the  study  of  cylindrical  field  coupled 


Fig.  VIII-5.  Cross  section  of  rotating  structure  showing  dimensions. 


systems  has  always  been  of  primary  interest  in  spite  of  the  fact  that  it  is  very 
difficult  to  achieve,  experimentally,  an  equilibrium  cylindrical  geometry  that 
approaches  the  idealization  of  a  cylinder.  For  example,  a  jet  in  free  fall  changes 
in  radius  radically  over  a  short  distance  unless  the  velocity  is  made  high  enough 
to  produce  turbulence.  In  fact,  the  very  velocity  of  the  jet  is  not  a  trivial  addition 
to  the  dynamics,  as  has  been  shown  in  dealing  with  the  convective  systems.*1 
In  contrast,  here  we  shall  remove  the  effect  of  rotational  convection  by  con¬ 
sidering  all  interactions,  both  theoretically  and  experimentally,  in  the  rotating 
frame . 
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2.  Description  of  Problems 

The  mechanical  configuration  that  will  be  considered  here  is  shown  in  Fig.  VIII-5. 

The  region  between  two  coaxial  rigid  circular  cylinders  is  filled  with  two  inviscid,  incom¬ 
pressible  nonmiscible  fluids  in  such  a  way  that  they  form  an  equilibrium  interface  at  a 
radius  R.  The  cross  section  is  assumed  to  remain  constant.  The  complete  assembly 
is  then  assumed  to  rotate  with  an  equilibrium  rotational  velocity  ft.  It  is  possible  to 
produce  such  a  configuration  by  filling  a  cylinder  with  two  fluids  and  rotating  the  cylinder 
fast  enough  to  centrifugally  drive  the  heavier  fluid  to  the  outside.  The  essentially  rigid 
body  rotation  that  results  depends  on  the  centrifugal  acceleration  being  large  compared 
with  the  acceleration  of  gravity.  Hence,  this  last  acceleration  will  be  ignored  in  this 
analysis . 

Although  it  is  not  usual  to  view  them  as  such,  problems  of  this  kind  are  characterized 

by  a  discrete  set  of  possible  transverse  modes  so  that  they  comprise  a  type  of  surface 
7 

resonator.  This  is  most  apparent  if  there  is  no  axial  dependence  of  the  modes  con¬ 
sidered.  In  the  experiments  described  in  this  report,  the  technique  used  to  verify  the 
theoretical  considerations  will  take  advantage  of  this  fact.  In  any  case,  an  experiment 
must  involve  a  finite  structure  so  that  the  axial  modes  will  also  be  selected  from  a  dis¬ 
crete  set  of  wave  numbers. 

Here,  both  EH-I  and  EH-II  interactions  will  be  considered.  The  type  EH-1  inter¬ 
action  is  produced  by  an  electric  field  that,  in  general,  is  in  the  radial  direction.  The 
EH-Ip  is  produced  by  equipotentials  on  the  inner  and  outer  cylinders  because  the  region 
between  would  be  filled  with  dielectric.  However,  in  the  EH-If  interaction  either  the 
inner  or  outer  cylinder  can  form  one  of  the  equipotentials,  while  the  other  might  be 
formed  by  the  interface.  For  the  EH-II  interaction,  the  inner  and  outer  cylinders  must 
be  dielectrics,  with  the  electric  field  imposed  by  parallel  plates  that  are  perpendicular 
to  the  z-axis.  The  types  of  velocity  and  electric  fields  to  be  considered  are  given  by 


V’  =  V'jSj  +  (ftr+Vg)  a2  +  V^a3 

(1) 

EH-Ip}  ®  =  (?+el)  al  +  e2a2  +  e3a3 
EH-Il}  E  =  ejFj  +  e2a2  +  (EQ+e3)  sf3 

(2) 

E-H  Equations  of  Motion 

The  EH  bulk  equations  will  be  taken  as 

V  •  V'  =  0 

(3) 

P^-’+Vp'  =0 

(4) 
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VX  E  =  0 

(5) 

v  •  eE  =  o. 

(6) 

wK'ere  p'  is  the  total  pressure  given  by 
p'  =  P  +  y£E2. 


The  primes  denote  quantities  evaluated  in  the  rest  frame.  The  velocities  of  interest 
are  not  large  enough  to  require  any  distinction  between  frames  for  the  electric  fields. 

It  is  apparent  that  the  mechanics  would  be  considerably  simplified  by  viewing  the 
motion  from  the  rotating  frame.  For  this  reason,  Eqs.  3  and  4  will  now  be  transformed 
to  the  rotating  frame.  This  is  accomplished  by  the  transformation 

r  =  r'  V'j  =  V( 

3  =  0'  -  tit  V2  =  V2  +  Hr 

(7) 

z  =  z'  P'  =  PQ  +  P 

t  =  t' 


If  only  linear  terms  are  retained,  the  momentum  and  continuity  equations  require 


2r2 


+ 


constant 


(8) 

(9) 

(10) 

(ID 

(12) 


The  terms  in  these  equations  are  those  that  would  be  observed  by  a  rotating  observer. 
The  usual  techniques  are  now  used  to  handle  these  equations.  The  total  pressure  is 
used  as  a  potential  for  deriving  the  other  mechanical  variables,  while  the  axial  elec¬ 
tric  field  is  used  to  determine  the  remaining  field  components. 

Because  of  the  cylindrical  geometry,  the  eigenvalues  satisfying  the  axial  boundary 
conditions  can  be  chosen  from  those  given  by  the  assumed  solution. 


p  .  £  ej(wt+me+kz) 


(13) 
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Here,  w  is  the  frequency  as  viewed  in  the  moving  frame.  In  all  present  considerations 
care  will  be  taken  to  excite  and  measure  all  frequencies  in  the  moving  frame. 

If  the  surface  of  the  interface  is  to  be  continuous  azimuthally,  the  eigenvalue  m  must 
have  integer  values .  The  wave  number  k  will  be  determined  in  the  usual  way  from  the 
boundary  conditions  in  the  axial  direction. 

The  assumed  solution  reduces  the  bulk  equations  of  motion  given  by  Eqs.  5,  6,  and 
9-12  to  a  pair  of  ordinary  differential  equations  that  define  all  of  the  remaining  functions. 


d2p 


dp 


-r+rx^fmVkV]  =  0 


dr 


dr 


(14) 


where 


A2  = 


1  - 


4f2 


1  dr 


/  d$,\ 

'  2  ' 

\dr  /  e3 

•2L-  +  ^2 
[r2 

=  0. 


Here,  in  terms  of  p  and  e^,  the  remaining  functions  are 


(15) 


A 

V, 


A 

V, 


As 

e. 


j  dp 

2jmG 

puA2  dr 

.2  2 
4pu 

II 

1 

3 

2C1 

pb)A2  \  r  / 

A2pw‘ 

-kp 

pw 

=  m  ( 63  \ 
k  \  r  / 

-j  de3 

k  dr 

dp 
2  dr 


(16) 

(17) 

(18) 

(19) 

(20) 


It  is  convenient  to  use  as  solutions  to  Eqs. 
and  the  Hankel  function  of  first  kind. 

P  =  AiJm<jkAr)  +  A2Hm<;>kAr> 

VCiJm<jkr>  +  C2Hm<jkr>' 


14  and  15  the  Bessel  function  of  first  kind 

(21) 

(22) 


However,  we  note  the  important  fact  that  the  .arguments  of  these  functions  as 
they  occur  in  all  of  the  mechanical  variables  depend  on  both  the  frequency  w  and  the 
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VX  E  =  0 

(5) 

V  •  eE  =  0, 

(6) 

where  p'  is  the  total  pressure  given  by 
p'  -  P  +  y  cE2. 

The  primes  denote  quantities  evaluated  in  the  rest  frame.  The  velocities  of  interest 
are  not  large  enough  to  require  any  distinction  between  frames  for  the  electric  fields. 

It  is  apparent  that  the  mechanics  would  be  considerably  simplified  by  viewing  the 
motion  from  the  rotating  frame.  For  this  reason,  Eqs.  3  and  4  will  now  be  transformed 
to  the  rotating  frame.  This  is  accomplished  by  the  transformation 

r  =  r'  V'j  =  Vj 

6  =  6'  -  fit  =  V2  +  Or 

(V) 

z  =  z'  P1  =  PQ  +  P 

t  =  t' 

If  only  linear  terms  are  retained,  the  momentum  and  continuity  equations  require 


1  2  2 

pQ  =ypQ  r  +  constant 

[8V  1  8p 

P|y-2nv2j  +w-=o 

(8) 

(9) 

[8V2  1  j  0p 

p[tT  +  2°VlJ  +r  89  =  0 

(10) 

rav  1  8p 

p[  8t  J  +  8z  =  0 

(ll) 

l8rV  8V  8V3 

7-ffF_  +  7-8e-  +  Tr=  °- 

(12) 

The  terms  in  these  equations  are  those  that  would  be  observed  by  a  rotating  observer. 
The  usual  techniques  are  now  used  to  handle  these  equations.  The  total  pressure  is 
used  as  a  potential  for  deriving  the  other  mechanical  variables,  while  the  axial  elec¬ 
tric  field  is  used  to  determine  the  remaining  field  components. 

Because  of  the  cylindrical  geometry,  the  eigenvalues  satisfying  the  axial  boundary 
conditions  can  be  chosen  from  those  given  by  the  assumed  solution, 

p  =  $  ejM+m0+kz)  (13) 
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wave  number  k.  This  is  the  salient  complication  of  the  rotation. 

The  surface  dynamics  are  represented  in  terms  of  the  unit  normal,  n,  directed  in 
the  positive  r  direction  in  equilibrium,  and  the  surface  displacement  6(6,  z,t)  from  an 
equilibrium  position  R. 

■j5f[r-(6(e,z,t)+R)]  =  0.  (23) 


Remember  that  |  is  the  surface  perturbation  as  viewed  in  the  fixed  frame.  Hence,  Eq.  21 
must  be  transformed  to  the  rotating  frame  in  which  it  takes  on  the  expected  nonconvective 
form 


A 


(24) 


The  normal  vector  does  not  depend  on  rates  of  change  with  respect  to  time,  and  is  there¬ 
fore  unaffected  by  the  transformation  to  the  rotating  frame  of  reference.  Hence 


ft  ==SLv 

n2  Rio  v  1 


n  =  — V 
3  u  1 


(25) 


The  boundary  conditions  are  given  here  for  convenience.  They  are  of  the  same  form 
as  those  used  in  conjunction  with  variables  defined  in  the  inertial  frame  because  the 
transformation  of  all  of  the  variables  involved  in  the  boundary  conditions  (see  Eq.  7)  is 
one  to  one.  Hence,  the  definition  of  the  interface  and  the  conservation  of  momentum 
at  the  interface  require  that 


n  ■  [V(2)-V(1)]  =  0 


(26) 


«  T  B--VU  +■ 


80 


8z 


I- 


,(1) 

a(3 


=  0, 


(27) 


where  T  is  the  surface  tension,  and  Tflp  is  the  total  stress  tensor  given  by 


Tap  =  €EaEp  "  6ap 


eVv 


■+  p 


Also,  the  tangential  component  of  the  electric  field  must  be  continuous  at  the  interface, 
n  X  [E(2)-EU)]  =  0.  (28) 

If  there  is  no  free  charge  on  the  interface,  a  type  I  interaction  requires  the  use  of  the 
additional  condition 


n  •  [e^E^-c^E^]  =  0. 


(29) 
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At  r  =  a,  b,  the  normal  velocity  must  vanish  because  of  the  condition  imposed  by  a  rigid 
boundary.  In  the  EH-II  interaction  it  is  assumed  that  these  boundaries  are  composed  of 
a  dielectric  material  so  that  it  is  possible  to  support  an  axial  electric  field  that  is  uni¬ 
form  in  the  unperturbed  state.  The  walls  are  assumed  to  have  the  same  dielectric  prop¬ 
erties  as  the  adjacent  fluid  dielectric,  so  that  there  is  no  polarization  charge  or  Korteweg 
current  on  the  rigid  boundaries .  On  the  other  hand,  the  type  I  interactions  are  assumed 
to  result  from  equipotentials  at  r  =  a,b;  and  this  requires  that  e^  =  0  there. 

4.  Dispersion  Equations 

The  technique  of  finding  the  dispersion  equations  that  are  consistent  with  perturba¬ 
tions  of  the  form  of  Eq.  1  3  is  now  straightforward.  Equations  16-20,  24,  and  25  make 
it  possible  to  evaluate  all  of  the  dependent  variables  in  terms  of  the  solutions  given  by 

Eqs.  21  and  22,  that  is,  in  terms  of  eight  constants,  a|^  a!1^,  A^ 

(2)  (2)  1  Z  1  2  1  2 

Cj  Cy  .  (The  superscripts  denote  the  region  in  which  the  variable  is  to  be  evaluated.) 

The  dispersion  relation  for  each  of  the  problems  is  the  compatibility  condition  of 
these  eight  constants  in  the  equations  resulting  when  the  solutions  are  substituted  in  the 
boundary  conditions  of  Eqs.  26-29. 

The  cylindrical  geometry  poses  the  complication  of  a  radial  dependence  for  the  equi¬ 
librium  pressure  field;  and  in  the  type  I  interactions,  a  radial  dependence  for  the  equi¬ 
librium  electric  fields.  Hence,  in  this  linear  analysis,  these  respective  fields  must  be 
written  in  the  form 

P  =  constant  +  ~  pnZ(R+£)2  +  p(r,  8,  z,  t) 

»  constant  +  y  pnZ(RZ+2R£)  +  p(r,  0,  z,  t)  (30) 


-v  +  e 

(R+g)  1  R  RZ 


+  e 


1 ' 


(31) 


Investigation  shows  that  for  the  EH-If  interaction,  Eqs.  26-29  are  linear  combina¬ 
tions  of  Eqs.  26,  (27)j,  (27)j,  {ZS)^  and  the  condition  that  Vj  =  0  at  r  =  a,b.  One  of 
these  equations  ceases  to  be  independent  in  the  limit  where  there  is  no  free  charge  on 
the  interface.  In  this  case  (the  EH-Ip  interaction)  Eq.  (27)^  is  replaced  by  Eq.  29. 

The  EH-II  interaction  is  defined  by  Eqs .  26,  (27)j,  (27 )y  28,  and  the  condition  that 
Vj  =  0  at  r  =  a,  b.  It  is  assumed  that  all  electromagnetic  perturbation  fields  are  finite 
at  infinity  and  at  the  origin.  This  last  condition  replaces  two  equations,  in  that  it  requires 
C(12)  =  =  0.  The  dispersion  equations  for  each  of  the  three  problems  can  be  sum¬ 

marized  as  follows; 


2 

u  = 


n2 
R  p 


■{(kR)Z-(l-mZ)+F+G}, 


eq 


(32) 
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where 

f  -BV(p<  V) 


,0) 


(2)x 


req 


A|  p'1'Pm(jkAa,  jkAR)-p'  -'Pm(jkAb,  jkAR) 


Pm<x'y)  =lkR 


Jm(y)  Lm<x>  ‘  Hm(y> 


m'  '  m' 


Lm(x)  Qm<y)  -  Lm(y) 


m'  m' 


Vx>  =  + 


Lm(x)  -  H|n(x)  + 


2mfl 

J  (x) 
m  ' 

U> 

X 

2mfi 

Hm<x> 

(j) 

X 

Sm(kR,  kb)  =  jkR 
Tm(kR,  ka)  =  jkR 


Hm(jkR)  Jm(jkb)  ~  Hm(jkb)  Jm(jkR)  \ 

Hm(jkR)  Jm(jkb)  “  Jm(jkR>  Hm(jkb)  J 

Jm(jkR)  Hm(jka)  ~  Jm(jka)  Hm(3kR>~ 

Jm(JkR)  Hm<jka)  ~  Jm(jka)  Hm(3kR)_ 


for  all  of  the  problems,  but  G  depends  on  the  type  of  interaction  that  is  con¬ 
sidered. 

EH -If 

G  =  r(2)[Sm(kR,  kb)+l  ]  -  T(  1  ^[Tm(kR,  ka)+l  ] 


r  = 


V  6 

RT  ’ 


EH-Ip 


G  =■ 


1 

RT" 

«(D 

6(2) 

[sm(kR.kb) 

Tm(kR,  ka) 
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EH-II 


G  =  — 


(€(2)_£(l)^(Rk)2  e2r 


(1)  Jm<JkR)  (2) 
e'^jkR)  — - e(Z,(jkR)  — - 


Jm(jkR) 


Hm(jkR) 


If  modes  of  motion  are  considered  that  have  no  z  dependence,  then  the  dispersion 
equations  take  a  simple  and  convenient  form  that  is  quadratic  in  the  frequency  u.  To 
show  this.it  is  recognized  that  as  k  ■»  0  (or  as  perturbations  in  the  axial  direction  assume 
an  infinite  wave  length), 


Tm(kR,  ka) 


Sm(kR,  kb) 


(f)2”  *  i 
(£)2m-  ■ 


Pra(*,y) 


(*♦?) 

J 

y  \2m  , 

i 

_ !_ 

+  \ 

JJ  ' 

1  u) 

mA 

'yN 

i2m 

-1 

(33) 


and  the  dispersion  equations  become 

A  Os)**  +  2B(|r)  +  C  +  (1_m2)  "  F  =  (34) 

where 


F 

'(f)2m  * 

o(1)- 

ro 

3 

+ 

*-* 

_ 1 

0<2)> 

.(fr  -  *. 

P 

,(#)2m  -  ■. 

P  * 

» ■  (£)■ 
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EH -If 

c  =  (r(1)-r(2))  +  m 

EH-Ip 


c  =  (r(1)-r(2)) - 

RT 

EH-II 

C  =  0. 

Note  that  for  a  given  mode  (a  given  integer  value  of  m)  A,  B,  and  C  are  constants 
fixed  by  the  physical  parameters  and  the  geometry.  For  each  mode,  there  are  two  pos¬ 
sible  frequencies  of  oscillation.  In  the  limit  II  —  0,  these  two  values  of  u>  approach  the 
same  value.  Hence,  the  effect  of  the  rotation  is  to  split  the  possible  resonant  frequencies 

into  disparate  pairs.  The  EH-II  waves  couple  to  the  electric  field  only  if  the  direction 

2 

of  propagation  is  in  the  direction  of  the  equilibrium  electric  field.  It  is  therefore  not 
surprising  that  the  electric  field  has  no  influence  on  the  resonant  frequency  of  the  type  II 
resonator  if  there  is  no  axial  dependence  of  the  surface  deformations. 

5.  Experiment 

The  EH-If  waves  discussed  in  the  previous  sections  are  being  investigated  by  using 
the  apparatus  shown  in  Fig.  VIII-6.  The  detailed  drawing  shows  the  resonator  that  is 
rotated  with  the  frequency  ft.  This  rotation  is  provided  by  a  dc  motor  equipped  with  a 
feedback  loop  for  speed  control.  (The  measurement  of  the  resonant  frequency  u>  is  sen¬ 
sitive  enough  to  require  a  speed  controlled  to  four  significant  figures.)  The  speed  is 
measured  within  l/8  of  a  revolution  with  a  10-sec  sample  time,  as  shown  in  Fig.  VIII-6. 
An  excitation  of  the  surface  waves  was  provided  by  torque  pulsations  from  a  dc  motor 
with  its  field  excited  by  a  square  wave  having  a  repetition  rate  w/2ir.  Because  the  oscil¬ 
lator  that  generated  the  square  wave  operates  at  16  times  the  frequency  u>,  it  is  possible 
to  measure  the  excitation  frequency  to  four  places,  by  using  a  10-sec  sample  time. 

The  low-frequency  m  =  1  and  m  =  2  modes  have  been  measured  by  using  a  cylinder 
of  length  8. 75  in.  with  a  =  0. 5  in.,  b  =  1 . 867  in.  and  R  =  1.31  in.  Although  frequency 
shifts  with  electric  field  have  been  observed  (they  are  small),  attention  has  been  con¬ 
fined  thus  far  to  attempting  to  understand  the  waves  without  field  coupling.  Figure  VIII-7 


(2) 

‘(£)2"  *  •’ 

+  r(1) 

(t)2”  *  >1 

_(ir)2m  -  *. 

(f-r  -  *J 

m2v(2)v(1)(e(1)-€(2))2 
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AND  DRIVE 
FOR  STROBOTAC 


Fig.  VIII- 6.  Experimental  arrangement  for  study  of  EH 
surface  waves  on  rotating  interface. 


a/u 


Fig.  VIII- 7. 

(a)  Low-frequency  m  =  1,  k 
mode  (no  electric  field). 

(b)  Low-frequency  m  =  2,  k 
mode  (no  electric  field). 
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shows  typical  data  with  no  electric  field.  It  can  be  seen  that  the  experiment  justifies  the 
mechanical  model  used  (at  least  for  the  prediction  of  the  resonant  frequency).  The  solid 
lines  were  drawn  by  using  Eq.  34.  The  surface  waves  are  excited  on  the  interface  of 
the  fluid  by  a  tnin  rubber  "flapper"  that  is  attached  to  the  outer  wall  as  shown  in 
Fig.  VIII-6.  The  rotational  pulsations  of  the  cylinder  then  set  the  fluid  into  a  slight 
motion  which  is  observed  under  stroboscopic  light.  An  extremely  sensitive  indication 
of  resonance  is  given  when  the  resonant  motions  of  the  interface  are  accompanied  by  no 
deflection  of  the  flapper.  It  is  at  this  frequency  that  the  flapper  does  not  inhibit  the 
motion  of  the  fluid,  and  hence  "pull"  the  resonant  frequency. 

J.  R.  Melcher 
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D.  MAGNETOACOUSTIC-WAVE  EXPERIMENT 
1 .  Introduction 

It  has  been  shown  by  Haus1  that  it  is  theoretically  possible  to  use  magnetoacoustic 
waves  to  convert  kinetic  power  of  a  flowing  plasma  to  ac  electrical  power  by  means  of 
an  external,  magnetically  coupled  circuit.  In  order  for  this  amplifying  interaction  to 
occur,  the  following  conditions  must  be  satisfied:  (a)  the  plasma  must  be  flowing  through 
a  region  of  constant  applied  transverse  magnetic  field;  (b)  the  flow  velocity  must  be 
greater  than  the  magnetoacoustic  wave  velocity,  and  (c)the  plasma  electrical  conduc¬ 
tivity  must  be  sufficiently  large  to  permit  the  waves  to  amplify  as  they  propagate. 

An  experiment  has  been  devised  to  try  to  produce  a  plasma  in  a  system  satisfying 
these  conditions.  The  system  is  a  homopolar  device  (see  Fig.  VIII-8)  in  which  a 
gas  is  heated  and  accelerated  electrically  in  a  uniform  externally  applied  magnetic  field. 
We  are  trying  to  determine  whether  or  not  the  resulting  rotating  plasma  will  satisfy  the 
above-given  conditions. 

In  recent  years  there  has  been  a  growing  interest  in  rotating  plasmas  for  such  uses 

2  3 

as  fast  discharge  capacitors,  arc  gas  heaters  for  wind-tunnel  or  materials  studies. 
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Fig.  VIII-8.  Homopolar  device. 


4 

and  for  basic  studies  in  plasma  physics.  In  each  of  the  cases  reported  the  magnetic 
field  was  so  large  that  the  flow  velocity  was  less  than  the  magnetoacoustic -wave  propa¬ 
gation  velocity.  Furthermore,  there  were  no  attempts  to  study  magnetoacoustic  waves. 
Nevertheless  these  studies  have  provided  a  useful  starting  point  and  some  theoretical 
framework  for  the  experiments  reported  on  here.  Some  initial  experiments  will  be 
described  in  which  the  objective  was  to  produce  a  plasma  satisfying  the  conditions  dis¬ 
cussed  above. 

2.  Description  of  Experiment 

The  apparatus  for  the  experiment  is  shown  in  Fig.VIII-9.  The  channel  (Fig.  VIII-10) 
consists  of  two  concentric  circular  brass  rings.  The  top  and  bottom  cover  plates  are 
either  glass  or  Plexiglas.  The  inside  surfaces  of  the  brass  electrodes  are  notched  near 
the  insulating  walls  and  coated  with  sauereisen,  a  high-temperature  porcelainlike  mate¬ 
rial,  in  an  attempt  to  confine  the  plasma  away  from  the  insulating  walls.  The  electrode 
currents  are  carried  by  4  radial  arms  that  join  at  the  axis  beneath  the  electrodes.  The 
chamber  is  sealed  by  4  neoprene  O-rings.  No  external  support  is  required  for  the  cover 
plates  once  the  chamber  is  evacuated.  The  axial  magnetic  field  is  produced  by  two  con¬ 
centric  coils  connected  in  series;  the  field  in  the  region  between  the  coils  is  uniform 
within  ±5  per  cent.  Five  taps  are  provided  on  each  coil  to  vary  the  amount  of  inductance 
from  3-30  mh.  The  energy  for  powering  the  field  coils  and  the  plasma  is  supplied  by 
an  800-p.f,  4-kv  capacitor  bank.  The  duration  of  an  experiment  is  10  msec,  or  less, 
depending  on  the  parameters  of  the  system,  so  that  no  external  cooling  is  necessary 
for  either  the  coils  or  channel.  The  switching  elements  used  in  the  experiments  are 
type  5550  ignitrons,  which  are  air-cooled  and  rated  at  12  kv  and  20  coulombs  for  pulse 
work,5  or,  for  example,  20,  000  amps  for  1  msec.  The  ignitron  igniters  are  excited 
by  5C22  hydrogen  thyratrons  that  are  fired  by  delay  circuits  and  pulse-forming  net¬ 
works.  See  Fig.  VIII-11. 

The  channel  is  evacuated  to  approximately  1  p  Hg  pressure  before  each  firing  and 
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Fig.  VIII-9.  Rotating-plasma  apparatus. 


backfilled  to  the  desired  pressure  with  the  working  fluid.  Most  of  the  experiments  were 
performed  with  helium  in  the  pressure  range  0. 5-10  mm  Hg.  At  pressures  much  out  of 
this  range  it  was  not  possible  to  break  down  the  gas  with  the  available  capacitor  bank 
when  an  external  magnetic  field  was  applied. 

Several  channel  configurations  had  been  used  in  previous  experiments,  which  formed 
the  basis  for  the  present  design.  Two  circuit  configurations  have  been  studied:  (a)  a  series 
connection  in  which  the  field  coils  are  in  series  with  the  plasma  and  (b)  separately 
excited  circuits  in  which  the  channel  and  field  coils  are  pulsed  from  separate  capacitor 
banks .  The  series  arrangement  has  the  advantages  that  it  (a)  uses  the  coil  inductance 
for  current  limiting,  (b)  provides  a  reasonably  long  discharge  time  and  (c)  produces 
high  magnetic  fields.  A  peak  discharge  current  of  1400  amps  and  magnetic  field  of 
15,  000  gauss  is  possible  with  a  discharge  time  of  5  msec.  At  lower  field  and  discharge 
currents  an  arc  is  observed  to  accelerate  azimuthally  until  a  point  is  reached  at  which 
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Fig.  VIII-10.  Channel  with  field  coils  removed. 

time  the  arc  suddenly  switches  mode  and  becomes  diffuse  for  most  of  the  remainder  of 
the  discharge.  The  current,  voltage,  and  light  intensity  are  illustrated  in  Fig.  VIII-12. 
The  only  noted  effect  of  pressure  changes  in  the  range  0.  5-10 mm  Hg  was  that  the  plasma 
was  not  diffuse  as  long  during  the  discharge  time  at  the  higher  pressures.  From  the 


Fig.  VIII -11.  Typical  ignitron  firing  circuit. 
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voltage  and  current  traces  the  voltage  is  quite  nearly  proportional  to  the  current.  This 
would  be  true,  for  example,  if  the  voltage  was  produced  by  a  v  X  l5  back  emf  if  we  assume 
a  limiting  velocity  for  the  plasma.  The  velocity  corresponding  to  the  values  given  is  of 
the  order  of  the  ionization  energy  velocity,  a  result  shown  by  others.  ^  Attempts  to 
short-circuit  the  plasma  to  measure  the  charge  stored  in  the  plasma  capacitor,  however, 
did  not  show  any  measurable  stored  charge . 

If  the  inductance  of  the  field  coils  is  reduced,  then  with  a  peak  field  of  6500  gauss 
and  a  discharge  current  of  600  amps,  short-circuiting  the  plasma  resulted  in  the  recov¬ 
ery  of  approximately  100  ^.coulomb  of  charge.  This  is  equivalent  to  an  ionization  of 
0. 1  per  cent  (or  n.  =  lO1^  ions/cc)  if  it  is  assumed  that  the  rotational  energy  of  the  ions 
is  of  the  order  of  the  ionization  energy.  Observations  of  the  light  intensity  indicate  that 
at  the  higher  currents  and  magnetic  fields  the  rotating  arc  phase  of  the  discharge  no 
longer  appears. 

With  the  separately  excited  configuration  used  and  with  a  current -limiting  resistor 
of  0.6  ohm  in  the  plasma  circuit,  experiments  indicate  that  the  plasma  again  exhibits 
capacitive  properties  (Fig.  VIII-12c).  In  these  experiments  the  magnetic  field  is  first 
established  by  part  of  the  capacitor  bank  and  the  remainder  of  the  bank  is  discharged 
into  the  channel  when  the  field  is  maximum. 

At  the  present  time  we  are  proceeding  with  instrumentation  to  determine  conditions 
experimentally  in  the  rotating  plasma.  Attempts  to  launch  magnetoacoustic  waves  in 
the  rotating  plasma  have  been  inconclusive. 

F.  D.  Ketterer 
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A.  GENERALIZATION  OF  THE  ERROR  CRITERION  IN  NONLINEAR 
THEORY  BASED  ON  THE  USE  OF  GATE  FUNCTIONS 

In  the  Wiener  theory  of  optimum  nonlinear  systems,  the  measure  of  performance 
is  the  mean-square  error  and  the  input  is  a  Gaussian  process.  A  reason  for  the  choice 
of  this  error  criterion  and  type  of  input  is  the  resulting  relative  analytical  and  experi¬ 
mental  simplicity  by  which  a  nonlinear  system  can  be  determined.  Recently,  experimen¬ 
tal  procedures  have  been  studied  by  which  one  can  determine  optimum  nonlinear  systems 

for  error  criteria  other  than  the  mean-square  error  criterion  and  inputs  other  than  a 
1  2 

Gaussian  process.  '  The  basic  procedure  is  to  expand  the  class  of  systems  of  interest 
into  a  complete  set  of  operators,  3Cn,  so  that  the  output  of  any  system  of  this  class  can 
be  expressed  as 

y(t)  =  ^  anyn(t)  U) 

n=l 

in  which 

yn(t)  =  Knlx(t,l-  <2> 

Figure  IX-1  is  a  schematic  representation  of  Eq.  1.  In  this  representation,  the  coef¬ 
ficients,  an>  are  amplifier  gains.  The  determination  of  an  optimum  system  is  thus 
reduced  to  the  determination  of  these  coefficients.  The  procedure  is  then  to  measure 
experimentally  the  desired  function  of  the  error  and  simply  to  adjust  the  amplifier  gains 
in  order  to  minimize  this  quantity.  This  procedure  is  guaranteed  to  result  in  the  opti¬ 
mum  system  of  the  class  being  represented  for  convex  error  criteria,  since  we  are 
then  guaranteed  that  there  is  a  unique  minimum  of  the  function  of  the  error  and  that 
there  are  no  local  minima.  A  difficulty  with  this  method  is  that,  in  general,  the  amplifier 


*This  work  was  supported  in  part  by  the  National  Institutes  of  Health  (Grant 
MH-04737-02);  and  in  part  by  the  National  Science  Foundation  (Grant  G-16526). 
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Fig.  IX-1.  Operator  expansion  of  a  system. 


gain  settings  are  not  independent  of  one  another  and  therefore  an  iterative  adjustment 
procedure  is  required.  This  interdependence  of  the  amplifier  gain  settings  also  makes 
any  estimation  of  the  optimum  gain  settings  by  analytical  techniques  too  unwieldy  to  be 
obtained  in  any  practical  manner.  Also,  one  generally  has  difficulty  in  obtaining  a  rea¬ 
sonable  estimate  of  the  class  of  nonlinear  systems  to  represent.  However,  there  is  a 
set  of  operators  based  on  the  gate  functions  which  can  be  used  to  expand  the  desired 
system  with  which,  for  any  error  criterion,  the  amplifier  gain  settings  do  not  interact. 
In  this  report,  we  shall  present  some  experimental  and  analytical  techniques  by  which 
the  amplifier  gains  can  be  determined  when  this  set  of  operators  is  used.  To  explain 
the  techniques,  the  determination  of  optimum  nonlinear  no-memory  systems  for  various 
error  criteria  will  be  presented.  The  extension  to  nonlinear  systems  with  memory  will 
then  be  given. 


1.  The  Gate  Functions 

For  nonlinear  no-memory  systems,  the  set  of  operators  that  we  shall  use  is  the 
gate  functions,  Q^.  These  functions  were  first  introduced  into  the  study  of  nonlin¬ 
ear  systems  by  Bose.3  For  this  set  of  operators  and  input,  x(t),  the  outputs,  yn(t), 
are 


y0M  =  Qjx{t)] 

yN(t>  =  QNlx(t>l*|1 


if  -oo  <  x(t)  <  Xj 
otherwise 

if  xn  *x(t)  <  xn+J;  n  *  0,N  ► 
otherwise 

if  Xjj  «  x(t)  <  oe 
otherwise 


(3) 
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Fig.  IX-2.  Transfer  characteristic  of  a  gate  function. 


in  which  >  xr  if  m  >  n.  Figure  IX-2  is  a  plot  of  the  n**1  gate  function.  Any  step 
approximation  of  a  nonlinear  no -memory  system  can  be  expressed  in  terms  of  this  set 
as 

N 

7<t>  =  Z  anQJX<t)]'  (4) 

n=0 

in  which  we  have  not  placed  any  restrictions  on  x(t). 

Before  proceeding,  we  note  four  important  properties  of  the  gate  functions.  The 
first  is 


N 

I  «„[*<«]  *  >• 

n= 0 

The  second  is 

Q^[x(t)]  =  Qn[x(t)]  for  P  >  0. 

The  third  is 

P 

IT  Qa  [*<*)]  =  0  unless  Oj  =  a2  =  . . .  =  ap. 
n=l  n 

Then,  by  use  of  Eqs.  6  and  7,  we  obtain  the  fourth  property,  which  is 


anSJx<t>]- 


n=0 


(5) 


(6) 


(7) 


(8) 
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2.  Optimum  No-Memory  Systems  for  Minimum  |E(t)|P 

In  order  to  illustrate  some  of  the  properties  of  the  gate  functions  and  also  methods 

for  determining  the  coefficients,  a_,  we  first  shall  discuss  the  determination  of  opti- 

n  th 

mum  nonlinear  no -memory  systems  for  error  criteria  that  are  the  mean  P  power  of 
the  magnitude  of  the  error,  E(t).  That  is,  the  error  criteria  that  we  first  shall  discuss 

are  ||E(t)||  =  |E(t)|P  for  P  =  1,  2,  3,  . . ,  .  We  shall  then  generalize  these  results  to 
error  criteria  that  are  arbitrary  functions  of  the  error. 

For  the  input  x(t),  let  the  desired  output  be  z(t).  The  error  is  then 

N 

Eft)  =  z(t)  -  £  anQjx(t)].  (9) 

n=0 

For  convenience,  we  shall  omit  writing  the  argument  t.  Thus  we  shall  write  the  error 
as  given  by  Eq.  9 

N 

E  ■ 1  - 1  *AH  (ioi 

n=0 

with  the  understanding  that  E,  x,  and  z  are  functions  of  time.  Then  the  Pth  power  of 
the  error  is 


EP  = 


N 


z 

L  n=0 


Y  a  Q 
L,  nn 


(x) 


(ID 


We  shall  obtain  a  more  convenient  form  of  this  expression.  Equation  11  can  be  expanded 
as 


N 

♦  CjzP-1 

I 

anQn<x> 

n=0 

- 

♦  c2zP"2 

\ 

an«nW 

n=0 

_ 

f* 

r 

+  . . . 


+  cT 


an«n<x> 


n=0 


(12) 


in  which  the  coefficients,  cn,  are  the  binomial  coefficients.  Substituting  Eq.  8  in  Eq.  12, 
we  then  have 
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EP=zP+c1zP-1  £  *nQni 


+  c2z 


P-2 


..  JX) 

n=0 

N 

Z  anQn(x)  +  •  •  ■ 
n=0 


t  c. 


N 

1  alvx)- 


n=0 

■Equation  13  can  be  written  in  the  form 
N 


P  P 
E  =  z  + 


=  zP  + 


Z  [ClzP_lan+C2zP  Zan+---  +  cPar]Qn(x) 

i=0 

l  K’P-‘J 


n=0 


N 


N 


1  -  l  «„<*  *  l 

L  n=0  J  n=0 


By  use  of  Eq.  5,  we  then  have 


eP  =  Z  [z-aJPQn(x)- 


n=0 


■»th 


The  magnitude  of  the  P  power  of  the  error  can  be  expressed  in  the  form 

iEip.[E2pii/2. 

Thus,  from  Eq.  15  we  have 


I E  | 


-il/2 


"W1 


n=0 


(13) 


(14) 


(15) 


06) 


(17) 


in  which  the  bar  indicates  the  time  average  of  the  function. 

The  optimum  set  of  coefficients,  a  ,  for  which  Eq.  17  is  a  minimum  can  be 
determined  by  setting  the  derivative  with  respect  to  equal  to  zero.  Thus,  from 
Eq.  17  we  have 
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»|E| 

8a. 


=  -P 


(z-aj)2P-1Q.(x)- 


|E| 


(18) 


The  numerator  in  Eq.  18  is  zero  except  when  the  amplitude  of  x(t)  falls  in  the  interval 


of  the  j**1  gate  function.  At  such  times,  the  denominator  is  equal 
Thus  we  can  rewrite  Eq.  18  to  obtain 


to  (z-a^ 


>2V>j 


"1 1/2 


8|E| 

8a. 


=  -P 


=  -P 


(z-a.) 


2P-1 


Jz-a. 


—  Qj(x) 


z-a.| 


QAx) 


(19) 


However, 

(z-a.)  =  |  z-a.|Sgn(z-a.) 
in  which 

r 

l 


(20) 


Sgn(z-a^)  =  \ 


0 

-1 


if  z  >  Sj 

if  z  =  a. 

3 

if  z  <  a j . 


(21) 


Substituting  Eq.  20  in  Eq.  19,  we  find  that  the  condition  for  |e|  to  be  a  minimum  is 


z-aj  |  P-1Sgn(z-aj)  Q^(x)  =  0. 


(22) 


We  can  show  that  Eq.  22  is  the  condition  for  a  minimum  by  differentiating  Eq.  18  with 
respect  to  a^  and  substituting  Eq.  22  in  the  result.  Then  we  obtain 


8  |E|P 

-j —  =  (2P-1)  P 


da 


(z-a.,2<p-1) 


?  o. 


(23) 


Equation  23  is  positive,  since  P  si  and  all  terms  being  averaged  are  always  positive. 
Thus  Eq.  22  is  the  condition  for  a  minimum.  We  note  that,  by  using  the  gate  functions, 
the  amplifier  gain  settings  do  not  interact  and  they  can  be  determined  individually  by 
means  of  Eq.  22. 

Equation  22  provides  a  convenient  experimental  method  for  determining  the  desired 
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coefficients,  an-  We  note  that  if  P  is  even,  then  Eq.  22  can  be  written  in  the  form 

(z-aj)P_1QJ(x)  =  0  [P  even].  (24) 

A  circuit  for  experimentally  determining  the  value  of  a^  that  satisfies  Eq.  24  is  depicted 
in  Fig.  IX-3.  The  battery  voltage,  V,  is  adjusted  to  make  the  meter  read  zero;  the  volt¬ 
age,  V,  is  then  numerically  equal  to  the  optimum  value  of  a^.  Another  procedure  is  to 
expand  Eq.  24  as 

_  P-1  _ 

(Mj)P_IQj(x)  =  J  cnznQj(x)  ajP_1"n)  =  0,  (25) 

n=0 

in  which  the  coefficients,  cn<  are  the  binomial  coefficients.  The  averages,  znQ^(x),  can 
be  measured  experimentally.  The  optimum  value  of  a^  can  then  be  determined  by 

substituting  the  measured  values  of  znQ^(x)  in  Eq.  25  and  solving  for  the  root  of  the 
resulting  (P-1) -degree  polynomial  in  a^. 


Fig.  IX-3.  A  circuit  for  determining  a^  for  P  even. 

If  P  is  odd,  then  there  are  two  basically  different  circuits  by  which  the  desired 
value  of  can  be  determined.  One  is  obtained  by  noting  that 

|z-a,|P  *Sgn(z-a.)  =  (z-a.)P~^|  z-a. |  [P^3andodd].  (26) 

J  J  J  J 

A  circuit,  based  on  Eq.  26,  for  experimentally  determining  the  desired  value  of  a^  is 
depicted  in  Fig.  IX-4.  The  second  circuit  is  obtained  by  noting  that  a  system  whose 
output  is  Sgn  (z-aj)  is  a  saturating  amplifier.  Thus,  for  P  odd,  a  second  circuit  is 
shown  in  Fig.  IX-5.  We  note  that  the  multiplier  in  Fig.  IX-5  need  only  be  a  polarity- 
reversing  switch  that  is  controlled  by  the  output  of  the  saturating  amplifier.  In  the 
special  case  for  P  =  1,  the  coefficients  for  the  system  with  a  minimum  mean  magnitude 
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error  are  obtained,  The  circuit  for  determining  these  coefficients  is  relatively  simple 
and  is  depicted  in  Fig.  IX-6. 


3.  Optimum  No-Memory  Systems  for  Minimum  F[E(t)] 

The  procedure  that  we  have  just  described  can  be  extended  to  the  determination  of 
optimum  nonlinear  no-memory  systems  for  error  criteria  that  are  arbitrary  functions 
of  the  error.  To  extend  this  procedure,  we  have  from  Eq.  10 


F[E]  =  F 


z 


I 

n  =  ( 


a  Q  (x) 
n  ' 


(27) 


According  to  the  definition  of  the  gate  functions  (Eqs.  3),  at  any  instant  only  one  term 

xi_ 

in  the  sum  is  nonzero.  If,  at  that  instant,  x(t)  is  in  the  interval  of  the  j  gate  function, 
then,  at  that  instant,  F(E)  =  (z-aj)Qj(x).  Thus,  by  the  use  of  Eq.  5,  we  can  express 
Eq.  27  as 

N 

F[E]  =  £  F[z-an]  Qn(x).  (28) 

n=0 

p 

Equation  28  is  identical  with  Eq.  15  for  F(E)  =  E  .  The  set  of  coefficients  for  which 
the  mean  of  Eq.  28  is  stationary  now  can  be  determined  by  setting  the  derivative  of 
F(E)  with  respect  to  a.  equal  to  zero.  Thus,  if  we  define 

G(°)  =  F(“).  (29) 


then  the  condition  that  a^  be  an  extremal  of  the  mean  of  Eq.  28  is 


G<z-aj)  Qj<x>  "  °‘ 


(30) 


A  sufficient  condition  that  assures  us  that  the  condition  given  by  Eq.  30  yields  a  mini¬ 
mum  of  F(E)  is  that 


■SF|«)iO.  (31) 

da 


Equation  31  is  the  statement  that  the  error  criterion  is  a  convex  function  of  its  argu¬ 
ment.  For  such  cases,  a  circuit  for  experimentally  determining  the  value  of  a^  that 
satisfies  Eq.  30  is  depicted  in  Fig.  IX-7.  The  battery  voltage,  V,  in  the  figure  is 
adjusted  to  make  the  meter  read  zero;  V  is  then  numerically  equal  to  the  optimum 
value  of  aj.  If  F(E)  is  not  a  convex  function  of  its  argument,  this  simple  procedure 
is  not  sufficient,  for  local  minima  and  maxima  can  then  exist.  However,  the  adjust¬ 
ment  of  any  one  coefficient  affects  only  one  term  of  the  sum  in  Eq.  28.  The  minimum 
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of  F(E)  is  thus  obtained  only  if  each  term  in  Eq.  28  is  a  minimum.  Therefore,  because 
of  the  special  properties  of  the  gate  functions,  the  amplifier  gains,  an<  do  not  interact 
even  for  arbitrary  functions  of  the  error  and  thus  they  can  be  determined  individually. 
This  affords  enormous  simplification  in  search  procedures  for  the  determination  of  opti¬ 
mum  systems  for  nonconvex  functions  of  the  error.  However,  if  G(c)  can  be  expressed 
as  a  polynomial,  then  the  values  of  a^  which  satisfy  Eq.  30  can  be  determined  without  a 
search  procedure.  For,  if  G(a)  can  be  expressed  as  a  polynomial,  then  from  Eq.  30  a 
polynomial  expression  in  a^  can  be  obtained.  The  coefficients  of  the  various  powers  of 

aj  will  involve  only  the  averages  znQ^(x)  which  can  be  measured  experimentally.  The 
values  of  a^  which  satisfy  Eq.  30  can  then  be  determined  by  solving  for  the  roots  of  the 
polynomial  in  a^.  Thus,  we  have  the  important  result  that  for  the  cases  in  which  F(E) 
can  be  expressed  as  a  polynomial,  no  search  procedure  is  required;  then  the  optimum 


value  of  a^  can  be  determined  analytically  by  solving  for  the  minimum  of  the  polynomial, 
F(z-a.)  Qj(x). 


4.  An  Example 


An  important  consequence  of  the  fact  that  the  amplifier  gains  do  not  interact  is  that, 
in  many  cases,  analytical  estimates  of  the  desired  optimum  nonlinear  system  can  be 
obtained.  We  shall  consider  a  simple  example  as  an  illustration.  A  received  signal 
is  the  sum  of  a  message  and  a  noise  that  are  statistically  independent.  The  optimum 
nonlinear  no-memory  system  is  to  be  determined  for  an  output  that  is  the  message. 

The  error  criterion  to  be  used  is  |e|^  in  which  P  >  1,  but  not  necessarily  an  integer. 
The  message  is  a  binary  signal  that  assumes  the  value  1  with  probability  1/2  or  the 
value  -1  with  probability  1/2.  We  shall  determine  the  coefficients  of  the  optimum  sys¬ 
tem  from  Eq.  30.  To  do  this,  let  the  random  variable  of  amplitude  of  the  message  be 
4  which  takes  on  values  z;  let  the  random  variable  of  amplitude  of  the  noise  be  q  which 
takes  on  values  y.  Then,  the  random  variable  of  amplitude  of  the  received  signal,  4,  is 
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6  =  l  +  t).  (32) 

On  the  ensemble  basis,  Eq.  30  for  our  example  is 

G(t-aj)  Q.(e)  *  0.  (33) 

p 

For  our  problem,  F(o)  =  |a|  so  that  from  Eq.  29 

G(«)  =  P|e|P_1Sgn(«),  (34) 

Thus  Eq.  33  becomes 


0  =  1^ 


P-1 


Sgn(C-a^)  Q.(4+ti) 


p  oo  p  no  P— 1 

=  j  dy  ^  dz  |  z-a^|  Sgn(z-a^)  Q^z+y)  P^  ^z.y). 

However,  since  the  message  is  binary  and  independent  of  the  noise,  we  have 
P  (z,y)  =  P  (z)  P  (y) 

=  i.[u(2-i)+u(z+i)]  P  (y). 


(35) 


(36) 


in  which  u(z)  is  the  unit  impulse  function.  Substituting  Eq.  36  in  Eq.  35  and  integrating 
with  respect  to  z,  we  obtain 


0  =-|-|  1-a  |P_1Sgn(l-a  )  f°°  Q.(y+1)  P  (y)  dy 

+  |-|-l-a  jP_1Sgn(-l-a.)  C"  Q^y-1)  P  (y)  dy. 
C  1  i  J.gg  J  h 

The  equation  that  must  satisfy  is  thus 

1-a.  j  P'J  Sgn(l-a.)  /“x  Q.(y-l)  P^y)  dy 


(37) 


1+a, 


Sgn(-l-a.) 


y  /"„  Qj(y+D  PT1(y)  dy 


(38) 


We  note  that  the  right-hand  side  of  the  equation  is  the  ratio  of  the  probability  that  (q-l) 
is  in  the  interval  of  the  j**1  gate  function  to  the  probability  that  (t|+l)  is  in  the  interval  of 
the  j111  gate  function.  Let  us  denote  this  ratio,  which  is  positive,  by  fL.  We  then  have 


1-a, 


1+a, 


P-1 


Sgn(l-a.) 
Sgn(-l-aj)  *  Pj- 


(39) 
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Since  f}  »  0,  we  require  for  a  solution  that 


Sgn(l-a^) 

Sgn(-l-a.) 

J 


<  0. 


Thus  we  require  that  |a.|  be  less  than  one.  Equation  39  can  thus  be  written  as 

J 

\P-1 

=  <V  l^i  <1. 


(40) 


For  any  set  of  gate  functions,  (3.  can  be  determined;  a.  can  then  be  obtained  by  means 

J  J 

of  Eq.  40.  If  each  gate  function  is  chosen  to  be  of  infinitesimal  width,  then 


Pj  =  P(y)  =  P  (y+1) . 


(41) 


in  which  P^(y)  is  the  probability  density  of  the  amplitude  of  the  noise.  For  such  a  case, 
a^  -  a(y)  is  the  transfer  characteristic  of  the  no -memory  system.  Solving  for  a(y)  from 


Eq.  40,  we  then  have 


a(y)  = 


l+[P(y)]1/(P"ir 


P  >  l 


(42) 


in  which  p(y)  is  given  by  Eq.  41.  We  thus  have  obtained  an  explicit  expression  for  the 
transfer  characteristic  of  the  no-memory  system.  The  analogy  between  the  approach 
taken  in  this  example  and  the  solution  of  a  differential  equation  by  means  of  difference 
equations  is  to  be  noted. 


5.  Probability  of  a  Function  of  the  Error 

Since  an  arbitrary  function  of  the  error  can  be  written  in  the  form  of  Eq.  28,  we  can 
also  determine  the  coefficients  to  minimize  Prob  {F[E]>A}.  We  have  from  Eq.  28  the 
condition 


I  V  1 

Prob  «  )  F[z-an]  Qn(x)  >  A  V  =  minimum. 

n=0  J 


(43) 


But  since,  at  any  instant,  only  one  term  in  the  sum  is  nonzero,  Eq.  43  can  be  written 
N 


^  Prob  {F[z-an]  Q^x)  >  A}  =  minimum. 


(44) 


n=0 
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However,  the  adjustment  of  any  one  coefficient  affects  only  one  term  of  the  sum  in 
Eq.  44.  The  minimum  of  Eq.  44  is  thus  obtained  only  if  each  term  of  the  sum  is  a 
minimum.  Thus  the  optimum  value  of  a^  is  that  for  which 

Prob  {F[z-aj]  Q.(x)  >  A}  =  minimum.  (45) 

A  circuit  by  means  of  which  the  optimum  value  of  a^  can  be  determined  is  depicted  in 
Fig.  IX- 8.  The  voltage,  V,  is  adjusted  to  minimize  the  meter  reading;  V  is  then 


*(♦) 
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SATURATING 
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RECTIFIER 


AVERAGER 

METER 


Fig.  IX-8.  A  circuit  for  determining  a,  that  satisfies  Eq.  44. 

J 

numerically  equal  to  the  desired  value  of  aj  and  the  meter  reading  is  the  probability 
that  F[z-aj]  Qj(x)  >  A. 

6.  Weighted-Mean  Function  of  the  Error 

All  of  the  results  that  we  have  obtained  can  be  extended  to  weighted  functions  of  the 
error.  By  a  weighted  function  of  the  error  we  mean  W(t)  F[E(t)]  in  which  W(t)  is  an 
arbitrary  function  of  time.  For  example,  W(t)  can  be  a  function  of  [z(t)-x(t)].  By  use 
of  Eq.  28,  the  weighted  function  of  the  error  can  be  written 

WF[E]  =  J  F[z-aJ  WQ^x).  (46) 

n=0 

Thus  all  of  our  results  apply  if  we  replace  Q^x)  by  WQ^x).  Thus,  to  minimize  a 
weighted  mean  of  a  convex  function  of  the  error,  we  have  from  Eq.  30  the  result  that  the 
optimum  value  of  a^  is  that  for  which 

G[z-aj]  WQj(x)  »  0.  (47) 

7.  Systems  with  Memory 

We  shall  now  present  an  extension  of  our  previous  results  to  nonlinear  systems  with 
memory.  A  complete  set  of  operators  that  could  be  used  for  the  synthesis  of  nonlinear 
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y« 


Fig.  IX-9.  Nonlinear  system. 


systems  with  memory  is  Wiener's  G -functionals. 4  These  operators  can  be  synthesized 
in  the  form  shown  schematically  in  Fig.  IX-9.  As  shown,  the  nonlinear  system  can  be 
divided  into  three  sections  that  are  connected  in  tandem:  a  linear  system  whose  impulse 
responses,  hn(t),  form  a  complete  set  of  functions;  a  nonlinear  no-memory  section  that 
consists  of  just  multipliers  and  adders;  and  a  section  consisting  of  amplifiers  whose 
outputs  are  then  summed.  In  this  model  of  a  nonlinear  system,  the  first  section  is  the 
only  one  that  has  memory,  since  its  outputs,  vn(t),  are  the  result  of  linear  operations 
on  the  past  of  the  input,  x(t).  We  note  from  this  model  that  the  output,  y(t),  of  a  non¬ 
linear  system  of  the  Wiener  class  is  just  a  linear  combination  of  various  products  of 
the  outputs,  vn(t),  of  the  linear  section.  If  the  linear  section  consists  of  only  K  linear 
systems,  then,  according  to  the  model,  the  output,  y(t),  can  be  expressed  as 


(48) 


By  use  of  the  gate  functions,  any  step  approximation  to  v^(t)  can  be  given  by 
N 

vi  ■  1  W 

n«0 


(49) 


By  substituting  Eq.  49  in  Eq.  48,  we  note  that  we  can  express  any  step  approximation 
to  y(t)  as 


•l  S-S 

k*l  nj»0  n^*C 


\ . nk<3n1<vl)  -  -^V- 


(50) 
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Wc  note  that  by  use  of  Eq.  5  we  can  write 


ft 

N 

1  ••• 

i.VV-V''1,1' 

“k+l-0 

nKe0 

Thus  Eq.  50  can  be  written  in  the  form 

‘  I  \ . n^n/V’”  VVk>' 

nr° 

r^aO 

Define 

w  *  VV|)  •  •  •  vVk)* 

in  which  a  =  (n^,  n 2,  ....  n^).  Then  Eq.  52  can  be  written  in  the  form 


(51) 


(52) 


(53) 


1 


(54) 


We  note  that  the  functions,  *0(v).  are  K-dimensional  gate  functions,  since  ®a(v)  is  non¬ 
zero  and  equal  to  one  only  when  the  amplitude  of  Vj(t)  is  in  the  interval  of  ^  (vp,  and 

the  amplitude  of  v,(t)  is  in  the  interval  of  Q  (v,),  and  so  on  for  each  v  (t).  That  is, 
b  n9  l  ix 

b  1U 

®  (v)  is  nonzero  and  equal  to  one  only  when  v  is  in  the  a  cell.  We  also  note  that 


£  *«W  *  »•  <**) 

a 

By  use  of  the  K-dimensional  gate  functions,  all  of  our  results  can  be  extended  to 
nonlinear  systems  that  have  memory.  For  we  note  that  if  the  desired  output  is  z(t), 
then  any  function  of  the  error  is 


F[E] 


■p.r* -!*.•.« 

La 


(56) 


According  to  the  definition  of  the  K-dimensional  gate  functions,  only  one  term  in  the 
sum  is  nonzero  at  any  instant.  If,  at  that  instant,  v  is  in  the  0th  cell,  then  at  that 
instant  F[E]  a  F(z-Aa)  ®a(v).  Thus,  by  use  of  Eq.  55,  we  can  express  Eq.  56  as 


F[E]  a  £  F[z-Aa]  *a(v>. 

a 


(57) 
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This  equation  is  identical  in  form  with  Eq.  28  and  thus  all  of  the  results  that  we  have 
obtained  for  the  one-dimensional  case  also  apply  to  the  K-dimensional  case.  For 
example,  from  Eq.  30  the  condition  that  A0  be  an  extremal  of  F(E)  is 

G[z-Aj  ®a(v)  =  0.  (58) 

Experimentally,  the  values  of  Ag  that  satisfy  Eq.  58  can  be  obtained  by  means  of  the 
circuit  depicted  in  Fig.  IX-10.  The  procedure  is  to  adjust  the  battery  voltage,  V,  until 
the  meter  reads  zero;  V  is  then  numerically  equal  to  the  desired  value  of  AQ  for  which 


Fig.  IX-10.  A  circuit  for  determining  Afl  for  which  F[E]  is  a  minimum. 

Eq.  58  is  satisfied.  However,  if  G(a)  can  be  expressed  as  a  polynomial,  then  a  poly¬ 
nomial  expression  in  Ao  can  be  obtained  from  Eq.  58.  The  coefficients  of  the  various 

powers  of  Aq  will  involve  only  the  averages  zn®a(v)  which  can  be  measured  experimen¬ 
tally.  The  values  of  A  that  satisfy  Eq.  58  can  then  be  determined  by  solving  for  the 
®  2 

roots  of  the  polynomial.  For  example,  if  F(E)  =  E  ,  then  Eq.  58  becomes 

(z-Aa)  *a(v)  =  0.  (59) 

Solving  for  Aa,  we  have 


A 


a 


S 


z*a(v) 


(60) 


Equation  60  is  the  result  obtained  by  Bose.3  We  thus  have  the  important  result  that,  as 
in  the  case  of  no-memory  systems,  no  adjustment  procedure  of  any  sort  is  required  in 
those  cases  in  which  F(E)  can  be  expressed  as  a  polynomial  If  F(E)  is  not  a  convex 
function,  the  optimum  value  of  A#  can  still  be  determined  analytically  by  solving  for 
the  minimum  of  the  polynomial,  F[z-A#]  ®#(v),  and  thus  no  search  procedure  is 
required.  Also,  the  set  of  coefficients  {Aa}  can  be  determined  for  which  the  mean  of 
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an  arbitrary  weighted  function  of  the  error  or  the  probability  of  an  arbitrary  function  of 
the  error  is  a  minimum.  Each  coefficient  of  this  set  can  be  determined  individually  in 
a  manner  similar  to  that  used  for  the  no-memory  case. 

M.  Schetzen 
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B.  OPTIMUM  QUANTIZATION  FOR  A  GENERAL  ERROR  CRITERION 
1 .  Introduction 

Quantization  is  the  nonlinear,  no-memory  operation  of  converting  a  continuous  sig¬ 
nal  to  a  discrete  signal  that  assumes  only  a  finite  number  of  levels  (N).  Quantization 
occurs  whenever  it  is  necessary  to  represent  physical  quantities  numerically.  The  pri¬ 
mary  concern  in  quantization  is  faithful  reproduction,  with  reBpect  to  some  fidelity 
criterion,  of  the  input  at  the  output.  Thus,  it  is  not  necessary  to  require  uniform  quan¬ 
tization.  In  fact,  since  the  number  of  output  levels,  N,  will  be  specified,  the  "error" 
in  the  output  will  be  minimized  by  adjusting  the  quantizer  characteristic.  Figure  IX- 11 
illustrates  the  input-output  characteristic  of  a  quantizer. 

Early  investigations  into  the  process  of  quantization  considered  speech  to  be  the 
quantizer  input  signal.  One  of  the  first  investigators  was  Bennett1  who  concluded  that, 
with  speech,  it  would  be  advantageous  to  taper  the  steps  of  the  quantizer  in  such  a  man¬ 
ner  that  finer  steps  would  be  available  for  weak  signals.  Following  Bennett,  other 

2  3  4 

investigators  such  as  Smith,  Lozovoy,  and  Lloyd  worked  toward  the  characterization 
of  optimum  quantizers  by  assuming  that  a  large  number  of  levels  would  be  used  in  the 
quantizer.  Max5  formulated  the  general  problem  of  selecting  the  parameters  of  the 
optimum  quantizer  for  a  wide  class  of  error  criteria  irrespective  of  the  number  of  levels 
in  the  device.  He  also  was  able  to  determine  a  partial  solution  to  the  problem  by  paying 
particular  attention  to  the  mean-square  error  criteria.  Bluestein5  derived  some  exten¬ 
sions  to  Max's  work  for  the  special  case  of  the  mean-absolute  error  criteria. 

In  this  report,  the  expression  for  the  quantization  error  as  a  function  of  the  quantizer 
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Fig.  IX-U.  Input-output  relationship  of  the  N-level  quantizer. 

parameters  is  presented  and  a  method  for  determining  these  parameters  by  utilizing 
the  technique  of  dynamic  programming  is  developed.  This  method  has  two  advantages 
over  the  previous  ones.  First,  the  method  "searches"  for  the  absolute  minimum  within 
the  region  of  variation  rather  than  the  relative  minima.  Second,  the  method  is  guaran¬ 
teed  to  converge  to  the  absolute  minimum  in  a  specified  number  of  steps. 

2.  Formulation  of  the  Problem 

Figure  1X-11  shows  the  input-output  relationship  for  the  general  quantizer.  The  out¬ 
put  is  y^  for  <  x  <  x^.  The  x^  are  called  the  transition  values;  that  is,  x^  is  the 
value  of  the  input  variable  x  for  which  there  is  a  transition  in  the  output  from  y^  to 
yk+j.  The  y^  are  called  the  representation  values. 

In  the  general  quantizer  problem  under  consideration  here,  the  x^,  as  well  as  the 
yk  (a  total  of  2N-1  quantities),  are  variables.  These  2N-1  variables  are  to  be  chosen 
in  such  a  manner  that  the  error  with  respect  to  a  specified  error  criterion  is  mini¬ 
mized. 

In  order  to  determine  the  specific  values  of  x^  and  y^  which  specify  the  optimum 
quantizer,  that  is,  the  quantities  that  we  shall  call  and  Y^,  we  must  first  derive  an 
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expression  for  the  quantization  error  criterion.  Since  by  optimum  we  shall  mean  min¬ 
imum  quantization  error  with  respect  to  the  specified  error  criterion,  the  Xj  and  the 
Yk  are  the  specific  values  of  xi  and  yk  which  yield  the  absolute  minimum  value  for  the 
quantization  error. 

There  is  one  set  of  constraints  which  must  be  imposed  upon  solutions  to  this  prob¬ 
lem.  For  purposes  of  organization  as  indicated  in  Fig.  IX- 11  we  shall  require  that 


x 

x 


1 

2 


> 


(1) 


XN-2  <  XN-lJ  • 

With  respect  to  the  quantizer  of  Fig.  IX- 11,  the  error  signal  when  the  input  signal 
satisfies  the  inequality 


is 


xi*x<xi+l 


x  -  y 


i+r 


(2) 

(3) 


We  shall  choose  the  error  criterion  to  be  the  expected  value  of  the  function 
g(x,yi+l)- 


Then  the  measure  of  the  error  in  this  interval  is 


(*  i+l  g<*.yi+1)  P(*)  dx,  (4) 

Jxi 

where  p(x)  is  the  amplitude  probability  density  of  the  input  variable  x.  Since  the  error 
in  each  of  the  quantization  intervals  is  independent  of  the  error  in  the  other  intervals, 
the  total  quantization  error  is 

-  ¥  pxi+i 

<oUl.xz . XN-1;  yry2*  •  •  •*yN)  *  2j  J  8(x.yi+i>  P«  dx.  (5) 

i=0  xi 


In  writing  this  equation  we  included  the  two  parameters  xQ  and  x^  for  convenience.  If 
x  is  a  bounded  function  with  lower  bound  X^  and  upper  bound  Xu>  then,  by  definition, 
xQ  will  equal  and  x^  will  equal  Xy, 

Upon  first  observation  it  appears  that  the  coordinates  of  the  absolute  minimum  value 
of  (5)  can  be  determined  by  use  of  the  methods  of  calculus.  Indeed,  if  there  is  only  a 
single  critical  point  of  Eq.  5  within  the  region  of  variation  (the  region  specified  by  Eq.  1) 
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and  if  this  critical  point  is  a  relative  minimum,  then  it  ie  also  the  absolute  minimum  of 

the  function.  However,  if  there  is  more  than  a  single  critical  point  within  the  region  of 

variation,  then  one  of  these  critical  points  may  be  the  absolute  minimum  of  (5).  Or  it 

might  be  that  none  of  these  critical  points  is  the  absolute  minimum.  Thus  the  method 

of  calculus  is  not  a  satisfactory  technique  for  solving  this  problem.  What  is  needed  is 

a  method  that  yields  the  coordinates  of  the  absolute  minimum  whether  or  not  the  abso- 

7  8 

lute  minimum  is  at  a  critical  point.  The  method  of  dynamic  programming  '  is  such 
a  technique. 


3.  Determining  the  Optimum  Quantizer  by  Using  Dynamic  Programming 

In  order  to  determine  the  coordinates  of  the  absolute  minimum  value  of  (5)  it  is  nec¬ 
essary  to  define  a  sequence  of  error  functionals 

{f^)}  i  *  1,  2, . . . ,  N 


as  follows. 


min  f>Xi 

fj(*i)  -  \  1  gtoyj)  P(*)  dx 

X^xi«xuLXl 


3'  3' 


min 

VX2 

y2*Xl 

v 

WX2<Xu 

Lxl 

min 

-pX. 

y3'X2 

i  • 

X|<X2<X3<XU 

*/Y 
.  x2 

g(x,y_)  p(x)  dx  +  f 


g(*.y3>  PM  dx  +  f2 


(*i>] 

•v] 


min  rrx  -i 

(x±)  =  y^x^j  \  gtx.yj)  p(x)  dx  +  f^jtx^) 

Xl*xi-l*xi*X\x  L  xi-1  J 


y  (6> 


fN-l(xN-l) 


fN<XN) 


min  CfxN  l 

yN-l'*N-2  1  "  g<x.yN_i>  P(x)  dx  +  fN-2<XN-2 

Xi<XN-2<XN-l<Xu  ^  N”2 

min  1 

yN**N-l  1  g(x,yN)  p(x)  dx  +  VjIXn.j) 

Xi<xN-l<xN<Xu  L  N-!  J 


) 
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It  should  be  noted  that  in  the  last  member  of  Eq.  6  we  have  taken  the  liberty  of  per¬ 
mitting  xN  to  take  on  variation  from  to  Xu  in  deference  to  our  previous  assumption 
(see  explanation  immediately  following  Eq.  5)  that  x^  =  Xu<  This  variation  is  necessary 
if  f^x^)  is  to  be  completely  determined. 

Also,  from  Eq.  6  we  can  show  that  the  last  member  of  this  set  of  equations,  when 
evaluated  at  xN  *  Xu>  can  be  written 


min 

fjj(Xu)  =  yi*xi-i  i=l»2,...N 

VVV  •  <Vxu 


^  J  1+1  g(x.yi+1)  P(x)  dx 


i=0  xi 


(7) 


Therefore,  f^(Xu)  is  the  error  for  the  optimum  quantizer  with  N  levels.  (Note  that, 
in  general,  f.(Xu)  is  the  error  for  the  optimum  quantizer  with  i  levels,  i  <  N.) 

It  is  possible  to  simplify  the  minimization  process  used  to  determine  the  error  func¬ 
tionals,  Eq.  6,  when  we  realize  that  the  minimum,  with  respect  to  y^,  occurs  for  a 
value  of  yt  which  satisfies  the  equation 

°=ri  [gOc^)]  P(x)  dx.  (8) 

*1-1  1 


If,  as  will  usually  be  the  case,  gtx.y^  is  greater  than  or  equal  to  zero  and  is  concave 
upward,  then  (8)  has  only  a  single  solution.  This  solution  will  be  denoted  by  y^.  (Should 
(8)  have  more  than  one  solution,  the  solution  that  minimizes  (6)  will  be  called  y^.)  Equa¬ 
tion  8  then  allows  us  to  rewrite  the  general  term  of  (6)  and  to  eliminate  the  formal  min¬ 
imization  with  respect  to  y^,  since  the  value  of  y^  which  minimizes  f^(x^)  for  specific 


values  of  Xj  and  x^_j  is 

y^.  Thus  (6)  becomes 

min 

K  1  «(x.yi)P<*)dx+ fi.jfXj.j) 

W  *  Xi-1 

i  »  1,2,. ...N 

(9) 

VXi-l<Xi<Xu 

L  xi-l 

where  xQ  =  X^,  a  constant,  and  fQ(X|)  *  0.  With  reference  to  Eq.  7  we  see  that  (8)  has 
reduced  the  number  of  variables  over  which  the  formal  minimization  must  be  performed 
from  2N-1  to  N. 

In  order  to  determine  the  optimum  quantizer  parameters  it  is  necessary  to  define 
two  auxiliary  sets  of  functionals;  first,  the  transition-value  decision  functionals 

{Xi(x>}  i  =  1, 2, . . . ,N; 


and  second,  the  representation-value  decision  functionals 
{Y^x)}  i-1.2 . N. 
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The  transition-value  decision  functionals  are  defined  as  follows 
Xj(x)  »  x; 

Xg(x)  =  the  value  of  Xj  which  minimizes  f2(x2)  for  a 
specified  x  =  x2,  y2  «  y"2; 

Xj^x)  a  the  value  of  which  minimizes  for  a 

specified  x  *  Xjj,  yN  «"y^ 

In  a  similar  manner  the  representation-value  decision  functionals  are  defined  as 
Yj(x)  for  a  specified  x  =  Xjj 

Y2(x)  =  y^  when  Xj  is  that  value  [i.e.,  Xj=X2(x)] 

which  minimizes  f2(x2)  for  a  specified 
x  *x2; 


YN(x)  *  yN  when  xN_J  is  that  value  [i.e.,  xN_1=XN(x)] 
which  minimizes  fj^Xj.)  for  a  specified 
x  =  *N 

Each  of  the  functionals,  Eqs.  6,  10,  and  11,  will  be  represented  by  a  tabulation  of  its 
value  at  a  number  of  points  taken  along  an  equispaced  grid.  In  general,  the  same  set 
of  points  will  be  used  for  each  of  the  functionals. 

As  we  have  previously  indicated,  f^(X^)  is  the  measure  of  error  for  the  optimum 
quantizer  with  N  levels  with  the  specified  error  criterion  used.  Therefore,  the  loca¬ 
tion  of  the  transition  value  between  levels  (N-rl)  and  (N)  is  specified  by  (10),  that  is, 

Xn-I  ■  XN<Xu>- 

This  level  will  be  represented  by  the  representation  value 
YN  “  YN<Xu>' 

specified  by  Eq.  11.  At  this  point  in  the  decision  process,  x  that  is  such  that 
Xjj_j  <  x  *  Xu  has  been  allocated  to  the  N**1  level.  The  remaining  values  of  x, 
X|  <  x  <  XN_1,  remain  to  be  quantized  into  N-l  levels.  From  (10)  the  transition 
value  between  levels  (N-2)  and  (N-l)  is  given  by 


(10) 


(11) 


QPR  No.  69 


140 


(IX.  STATISTICAL  COMMUNICATION  THEORY) 


^N-2  “  XN-1<XN-1)' 

and  from  (11)  the  representation  value  is 
YN-1  *  YN-1(XN-1)- 

This  decision  process  is  continued  until  the  first  representation  value  is  found  from  (11) 
to  be 

Y1=Y1<X1>- 

Once  the  decision  process  is  completed,  the  optimum  quantizer  with  respect  to  the  spec* 
ified  error  criterion  is  determined. 

As  a  first  example,  this  method  of  selecting  the  optimum  quantizer  is  being  applied 
in  the  case  in  which  speech  is  the  input  signal.  In  this  application  the  error  criterion 
will  be  the  expected  value  of  the  function 

*<*.yi+1)  =  l*-yi+1lv  w(x). 


where 

W(x)  >0  -oo  <  x  <  oo. 


J.  D.  Bruce 
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A.  TWO-DIMENSIONAL  POWER  DENSITY  SPECTRUM  OF  TELEVISION 
RANDOM  NOISE 


1 .  Introduction 

1  -4 

Several  workers  have  investigated  the  subjective  effect  of  television  random  noise. 
They  have  tried  to  relate  the  subjective  effect  of  additive  Gaussian  noise  to  the  one- 
dimensional  power  density  spectrum  of  the  noise  considered  as  a  function  of  time. 
Although  the  noise  is  one -dimensionally  generated  as  a  function  of  time,  it  is  neverthe¬ 
less  displayed  in  two-dimensional  fashion.  Therefore,  it  has  been  our  opinion  that  it 
might  be  more  fruitful  to  try  to  relate  the  subjective  effect  of  additive  Gaussian  noise 
to  the  two-dimensional  power  density  spectrum  of  the  noise  considered  as  a  function  of 
two  space  variables.  In  this  connection  the  following  question  naturally  arises.  A  one¬ 
dimensional  Gaussian  noise  with  known  power  density  spectrum  ®c(<*>)  is  given.  From 
this  noise  a  two-dimensional  noise  is  obtained  by  scanning.  What  is  the  power  density 
spectrum  *  (u,v)  of  the  two-dimensional  noise  in  terms  of  «o(w)  ?  In  this  report  we 
shall  attempt  to  answer  this  question. 

2.  Two-Dimensional  Power  Density  Spectrum 

We  restrict  our  attention  to  black-and-white  still  pictures.  The  two-dimensional 
noise  is  completely  specified  by  its  magnitude  n  (x,y)  as  a  function  of  the  space  variables 
x  and  y.  The  geometry  of  the  scanning  lines  is  shown  in  Fig.  X-l.  We  assume  that  the 
scanning  is  from  left  to  right  and  from  top  to  bottom,  and  that  there  is  no  interlacing. 
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Laboratory,  a  center  for  research  operated  by  Massachusetts  Institute  of  Technology, 
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AF19(604)-7400;  and  in  part  by  the  National  Institutes  of  Health  (Grant  MH-04737-02), 
and  the  National  Science  Foundation  (Grant  G-16526). 
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X 


y 


£ 

(0,0) 


L 


1 


(it  +  T,,y  +  Tj) 


Fig.  X-l .  Geometry  of  scanning  lines. 

The  length  of  each  scan  line  is  L,  and  the  distance  between  two  successive  scan  lines 
is  taken  as  the  unit  length.  Let  the  one-dimensional  noise  be  nQ(t),  and  let  (x,y)  a  (0,0) 
correspond  to  t  =  0.  Then  we  have 

n*(x,y)  =  n(x.y)  ^  6{y-k),  (l) 

k 

where 

n(x.y)  =  no(x+yL)  (2) 

and 

k  =  0,  ±1,  12,...;  -oo  <  x  <  oo,  -oo  <  y  <  oo. 

Any  particular  noise  picture  can  be  considered  as  a  finite  piece  of  the  sample  func¬ 
tion  (1)  which  is  infinite  in  extent.  The  impulses  6(y-k)  are  used  so  that  we  can 

* 

deal  with  the  Fourier  transform  of  n  instead  of  the  Z-transform. 

We  assume  that  the  one-dimensional  noise  is  ergodic,  then  n(x,y)  is  also  ergodic. 
The  autocorrelation  function  of  n(x,y)  is 


♦(Tj,t2)  =  n(x,y)  ntx+Tj.y+Tg) 


=  n(t)  n(t+Tj+T2L) 

=  *0(t1  +  T2L),  ( 

where  4q(t)  is  the  autocorrelation  function  of  nQ(t).  Let  ®(u,v)  and  ®Q(u)  be  the  power 
density  spectra  of  n(x,y)  and  nQ(t),  respectively.  We  have 
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p®  p«  J(t.u+t,v) 

•(u.v) *  \  \  ^Tj,T2)e  dTldT2 

-oo  ^-00 

P«  p®  j(T,u+r,v) 

*  J  J  ♦o(tI+T2L>  6  dTldT2 


«) 


Figure  X-2  shows  a  possible  ®Q(u)  and  its  corresponding  ®(u,v).  It  is  to  be  noted 
that  *(u,v)  is  zero  everywhere  except  on  the  line  u  -■£  =  0  where  it  is  an  impulse  sheet. 


*0<“> 


4  (u,  v) 


Fig.  X-2.  Relation  between  t(u,v)and  *Q(w). 

For  ordinary  commercial  television,  we  have  L  *  500.  Hence  the  line  u  -■£  ■  0  is  very 
close  to  the  v-axis. 
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Letting  **(u, v)  be  the  power  density  spectrum  of  n*(x,y),  we  have 

**(u,v)  *  ^  *(u,v+2irk)#  (5) 

k 

where  the  summation  is  over  all  integers.  **(u,v)  consists  of  identical  impulse  sheets 
v  2vk 

on  the  lines  u  --jj  *  — jj-,  k  *  0,  ±  1,  ±  2,  . . .  . 

3.  One -Dimensional  Power  Density  Spectrum  along  a  Particular  Direction 

We  have  derived  the  power  density  spectrum  ®*(u,v)  of  a  two-dimensional  noise  in 
terms  of  the  one -dimensional  power  density  spectrum  along  the  x-direction.  It  is  rea¬ 
sonable  to  believe  that  the  subjective  effect  of  noise  depends  also  on  the  one -dimensional 
power  density  spectra  along  directions  other  than  the  x-axis. 


Fig.  X-3.  Calculation  of  the  one -dimensional 
power  density  spectrum  along  the 
a  direction. 


We  shall  find  the  one -dimensional  power  density  spectrum  *a(w)  along  a  line  of 
slope  a  (Fig.  X-3).  Let  n*(z)  be  the  noise  along  such  a  line: 


n*U) 


7TT7 


k  ■  0,  ±  1,  ±  2, . . .  , 


(6) 


where 


na(z)  *  n(z  cos  6,  z  cosd+b)  for  some  fixed  b. 
The  autocorrelation  function  of  na(z)  is 


(7) 
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+a(T)  *  na(z)  na(e+T) 

»  n(x,y)  n(x+rcos8,  y+rain8) 
*  ♦(t  cos  8,  r  sin 8) 


*k<Tl>  "  +a(s3Fe)  *  *<Tl’  aTl>* 

The  Fourier  transform  of  4>a(T  j)  ia 


»(u)-av,v)  dv . 


Hence  the  Fourier  transform  of  ^a(T1)  is 


(  ^  ) 

Vcos  8/ 


2ir|cos  8|  oo 
Putting  Eq.  4  into  Eq.  11,  we  have 


1 


*  (w)  »  — 
a  |(l+La)  cos  8  | 


« r - a _ ! 

o|_(l+La)  cos  8 


(8) 


(9) 


(10) 


(ID 


(12) 


In  particular,  for  a  *  0,  we  have 

*o<«>  *  ®0(w) 

which  checks  with  our  assumption.  For  a  >  so,  wt  have 


(13) 


We  note  that  for  L  *  500,  the  bandwidth  of  *n(<»)  is  500  times  that  of  *Q(u>).  Figure  X-4 
Jx  4  a2 


shows  how  the  factor 


|l+La| 


■  varies  with  the  slope  a.  We  note  that  the  area  under 


*a(w)  is  independent  of  a. 
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+  0^ 

1 1  ♦  La  | 


L 


Fig.  X-4. 


yrr? 

|l+La| 


vs  a. 


Finally,  from  Eq.  6.  we  find  that  the  one -dimensional  power  density  spectrum 
along  the  direction  a  is 


(14) 


where  k  *  0,  ±1.  ±2,  ...  . 


4.  Discussion 

1  -4 

From  subjective  tests,  it  has  been  found  that  for  pictures  that  are  more  or  less 
isotropic,  low-frequency  noise  (low-frequency  when  considered  as  a  time  function)  is 
in  general  more  annoying.  From  Eq.  12  we  know,  however,  that  a  two-dimensional 
noise  obtained  from  a  low-frequency  one  -dimensional  noise  by  scanning  may  contain 
high  frequencies  along  directions  other  than  the  x-axis.  In  particular,  the  bandwidth 
along  the  y-axis  is  approximately  500  times  as  wide  as  that  along  the  x-axis.  Since  the 
spatial  frequency  response  of  the  human  eye  has  a  high-frequency  cutoff,  we  suspect 
that  the  following  hypothesis  might  be  true  for  isotropic  pictures  contaminated  by  addi¬ 
tive  Gaussian  noise.  The  more  anisotropic  a  noise  is,  the  more  annoying  it  will  be. 
Work  is  being  carried  out  to  test,  among  other  things,  this  hypothesis,  but  the  results 
are  still  not  conclusive. 

It  is  important  to  note  that  the  mathematical  model  from  which  we  obtained 
the  power  density  spectra  is  quite  crude.  In  order  to  relate  the  spectra  to 
the  subjective  effect  of  noise,  modifications  may  have  to  be  made  to  take  care 
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of  the  finiteness  of  scanning  aperture. 

We  should  like  to  thank  W.  L.  Black  who  offered  many  very  helpful  comments 
on  this  work. 

T.  S.  Huang 
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B.  SEQUENTIAL  DECODING  FOR  AN  ERASURE  CHANNEL  WITH  MEMORY 

1.  Introduction 

It  has  been  shown  that  sequential  decoding  is  a  computationally  efficient  technique 
for  decoding  with  high  reliability  on  memoryless  (or  constant)  channels.1  It  is  of 
interest  to  determine  whether  or  not  similar  statements  are  true  for  channels  with 
memory. 

In  order  to  gain  insight  into  the  problem  of  sequential  decoding  with  memory,  a 
simple  channel  model,  a  two-state  Markov  erasure  channel,  has  been  analyzed.  The 
presence  of  memory  is  shown  to  increase  the  average  number  of  computations  above 
that  required  for  a  memoryless  erasure  channel  with  the  same  capacity.  For  the  era¬ 
sure  channel,  the  effects  of  memory  can  be  reduced  by  scrambling. 

2.  Channel  Model 

Assume  as  a  channel  model  an  erasure  channel  with  an  erasure  pattern  that  is 
generated  by  a  Markov  process.  Assume  the  process  shown  in  Fig.  X-5.  Assume  for 


Po 


Fig.  X-5.  Noise  model. 
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simplicity  that  the  process  begins  in  the  0  state.  This  will  not  affect  the  character  of 
the  results. 

It  can  be  shown  that  the  probability  of  being  in  the  0  state  after  many  transitions 


becomes  independent  of  the  starting  probabilites  and  approaches 
capacity  is  defined  as 


Channel 


C  »  lim  max  I(xn;yn), 


n— oo 


P(xn) 


where  xn  and  yn  are  input  and  output  sequences  of  n  digits,  respectively.  Then,  intui¬ 


tively,  C 


'1 


P0  +P 


-,  since  information  is  transmitted  only  when  the  noise  process  is  in 


the  0  state  and  the  frequency  of  this  event  approaches  n  — — with  large  n. 


3.  Decoding  Algorithim 

We  encode  for  this  channel  in  the  following  fashion:  A  stream  of  binary  digits  is 
supplied  by  the  source.  Each  digit  selects  £  digits  from  a  preassigned  tree  (see 
Fig.  X-6).  If  a  digit  has  the  value  1,  the  upper  link  at  a  node  is  chosen,  the  lower  link 
being  chosen  when  it  is  0.  (In  our  example  (1,  0,  0)  produces  (011, 101,  000).) 

Our  object  in  decoding  will  be  to  determine  the  first  information  digit  of  a  sequence 
of  n  digits.  Having  done  this,  we  move  to  the  next  node  and  repeat  the  decoding  process, 
again  using  n  digits.  The  following  decoding  algorithim  is  used:  Build  two  identical 
decoders  that  operate  separately  on  each  subset  of  the  tree  in  time  synchronism.  Each 


i 
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decoder  compares  the  received  sequence  with  paths  in  its  subset,  discarding  a  path 
(and  all  longer  paths  with  this  as  a  prefix)  as  soon  as  a  digit  disagrees  with  an  unerased 
received  digit.  When  either  a  subset  is  identified  (one  subset  is  eliminated)  or  an 
ambiguity  occurs  (that  is,  when  an  acceptable  path  is  found  in  each  subject),  we  stop. 
If  an  ambiguity  occurs,  ask  for  a  repeat.  The  computation,  then,  is  within  a  factor  of 
2  of  that  necessary  to  decode  the  incorrect  subset.  We  average  the  computation  over 
the  ensemble  of  all  erasure  patterns  and  all  incorrect  subsets. 

4.  Computation 

Let  equal  the  number  of  nodes  that  are  acceptable  at  the  i**1  state  of  the  incorrect 
subset.  We  trace  two  links  for  each  node  or  perform  two  computations.  Then,  the 
average  computation  to  decode  the  incorrect  subset  is  I(n). 

nBrl 

I(n)  =  2  2 
i*0 

where  R  *  -j-,  the  rate  of  the  source.  We  have 

xj  =  2  ~  Pr  (a  path  of  if  digits  is  acceptable) 


Now 


'  t  (if  * 

r=0 

where  Pm(r)  is  the  probability  of  r  erasures  in  m  transitions.  Then 

«”>  ■  t  (if*’"  i  *rpi«b->- 

i*0  r=0 

We  recognize  that  the  sum  on  r  is  the  moment-generating  function  of  erasures,  gm(s), 
evaluated  at  s  *  In  2. 

*m<»  ■  Z  6  P<em>* 

em 

where  em  is  a  sequence  of  m  noise  states  and 
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+<«m> 


<t>(x> 


i-i 

It  can  be  shown  that 


x  =  ? 

X  *  1 


Sm(s)  =  iL_o 


where  II  = 


qo  poe 

P1  V* 


,  and  that  II  is  asymptotically  equal  to  the  m  power  of  the 


largest  eigenvalue  of  II.  Then,  we  find  the  following  asymptotically  correct  expression 
for  I(n) 


I(n)~ 


R  <  R 


comp 


V 


n(R-Rcomp) 


R  >  R 


comp 


where  wQ,  Uj  are  constants  and 


Rcomp  =  1  "  log2  2 


_  p0(t=I)  +  J ?"po(t^))  “  8(*  'T^j  • 


Using  similar  techniques,  we  can  find  an  upper  bound  to  the  probability  of  ambi¬ 
guity  and  show  that  it  is  equal  to  the  random  block-coding  bound  and  that  the  zero  rate 
exponent  of  this  bound  is  Rcomp' 

The  computational  cutoff  rate,  Rcomp#  i®  shown  as  a  function  of  pQ  for  fixed  capacity, 
c,  in  Fig.  X-7.  Rcomp  is  an  increasing  function  of  pQ  for  constant  c.  It  is  equal  to 

Rcomp*  th*  memory  lea  s  rate,  when  pQ  ■  qt  *  1  -  c.  and  it  exceeds  R°omp  when  p0  >  qr 
However,  pQ  >  qj  is  not,  in  general,  physically  meaningful,  since  this  situation  corre¬ 
sponds  to  anticlustering. 

5.  Conclusions 

We  conclude  that  sequential  decoding  is  inefficient  when  the  channel  becomes  "sticky” 

(small  pQ).  It  is  possible,  however,  to  reduce  the  effective  memory  of  the  channel  and 

increase  R _ if  we  scramble  before  transmission  and  unscramble  after  reception. 

comp 


Since  the  ergodic  probability, 


po  +  pl 


,  of  erasures  is  not  changed  by  this  maneuver. 


channel  capacity  remains  constant. 

Most  of  these  results  appear  to  be  extendable  to  binary-input,  binary-output  channels. 
It  is  not  clear,  however,  that  scrambling  will  improve  performance  on  binary  channels. 
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since  capacity  is  reduced  in  the  process.  Work  is  continuing  on  this  problem. 

J.  E.  Savage 
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C.  A  SIMPLE  DERIVATION  OP  THE  CODING  THEOREM 

This  report  will  summarize  briefly  a  new  derivation  of  the  Coding  Theorem.  Let 
*  •  • »  be  a  set  of  M  code  words  for  use  on  a  noisy  channel,  and  let 
jvlz,  ...  ,j£L  be  the  set  of  possible  received  sequences  at  the  channel  output.  The 
channel  is  defined  by  a  set  at  transition  probabilities  on  these  sequences,  Pr  (y\ 
Define 


Pr  <*>  «  l  Pr  (*m)  Pr  (y\jrj 
m 

pr  (yl*m>  Pr  (Sm> 


Pr  (*mllO 


Pr 


(1) 


(2) 


For  a  given  received^,  the  decoder  will  select  the  number  m  for  which  Pr  (3m  jy) 
is  a  maximum.  We  call  this  number  my.  The  probability  that  the  decoder  selects  cor¬ 
rectly  in  this  case  is,  then,  Pr  issm  IjO-  Therefore  the  probability  of  decoding  error 
is  ~ 

P.  ■  I  Pr  m  >)]■  (3> 


We  now  upper-bound  the  expression  in  brackets  in  Eq.  3. 


-Pr(xm  ly).  £  Pr(2?ml?) 

\  Z  '  m#m 


for  any  p  >  0. 


(4) 


(5) 
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Equation  5  follows  from  the  fact  that  is  a  convex  upward  function  of  t  for  p  >  0. 

Rewriting  Eq.  5,  we  obtain 


1-pr^mJy)‘  1  * 

I  Pr  (^m,ly)1/(1+p) 

'  T  mPny  “ 

m'+my 

,P 


«  1  **  [  1  Pr 

m  m'#m 


(6) 


(7) 


Equation  7  follows  from  overbounding  the  first  term  of  Eq.  6  by  summing  over  all  m. 


and  overbounding  the  second  term  by  replacing  the  missing  term,  Pr 


,  with 


a  smaller  missing  term,  Pr  (js  J-/).  Now  assume  that  the  code  words  are  equally 

1  m  jr 

likely,  so  that  Pr  (zm)  ■  j^-for  all  m,  and  substitute  Eq.  2  in  Eq.  7. 

1  -  »  Jl|)  «  l  Pr  £  Pr 


Lm,#m 


(8) 


» (ybm)1/(Up' [I  Pr(yUm,)‘/(,wl 

y  m 


lm'#m 


(9) 


Equation  9  bounds  Pg  for  a  particular  code  in  terms  at  an  arbitrary  parameter  p  >  0. 
We  shall  now  average  this  result  over  an  ensemble  of  codes.  For  each  m,  let  jcm  be 
chosen  independently  according  to  some  probability  measure  P(Jjc). 

p. «  m  11  I  p<*m> Pr  <?l-V,/0w,f  I  Pr  iylam.i1/(lt|,>]  •  (>»> 


m  x 


m 


m'#m 


J 


The  bar  over  the  last  term  in  Eq.  10  refers  to  the  average  over  the  ensemble  of  all  code 
words  other  than  m.  Now  let  p  <  1.  Then  we  are  averaging  over  a  convex  upward  func¬ 
tion,  and  we  can  upper-bound  Eq.  10  by  averaging  before  raising  to  the  power  p.  This 
is,  then,  the  average  of  a  sum  of  M-l  identically  distributed  random  variables.  Thus, 
for  0  <  p  <  1, 

Fe  «  5  1  I  1  P<Zm>  **  <?lsm>l/(1+P)(<M-l>  £  P<*>  Pr  ■  (»> 

t  ”^n 


l 


J 
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Removing  the  index  m  in  the  sum  over  Xm,  and  summing  over  m,  we  have 

1 1+P 

_  V"  IV"  i  ! 


^  Z  ^P(-)Pr(i,l-)l/<l+P>  <M_1) 


OZ) 


The  bound  in  Eq.  12  applies  to  any  channel  for  which  Pr  ( y  |je)  can  be  defined,  and 
is  valid  for  all  choices  of  P(x)  and  all  p,  0  <  p  <  1.  If  the  channel  is  memoryless  (that 

>  N  i 

is,  if  Pr  ( yix)  »II  Pr  (y-n|xn),  where  y  (yv  ■  ■  ■  yn<  •  •  .yN).-£  ■  (*j.  •  ••*„. 

•  *  *  •  *N))»  then  the  bound  can  be  further  simplified.  Let  P(x)  be  a  probability  measure 

that  chooses  each  letter  independently  with  the  same  probability  (that  is,  P(x)  ■ 

N 

n  p<xn».  Then 
n«l  n 


The  term  in  brackets  in  Eq.  13  is  the  average  of  a  product  of  independent  random  vari¬ 
ables,  and  is  equal  to  the  product  of  the  averages.  Thus 


p,‘ 


P(xtIPr(rJxk) 


1/0+P) 


1+p 


(M-l)p 


(14) 


where  Xj,  . . . ,  xR  are  the  letters  in  the  channel-input  alphabet.  Applying  an  almost 


identical  argument  to  the  sum  on  y,  we  obtain 


r  i 

1+p 

p  < 
e 

i 

2  P(xk)Pr  (y.|xk)l/(1+p) 

j-i 

k«l 

_ 

N 


(M-l)p. 


where  y^,  . . .  y j  are  the  letters  in  the  channel -output  alphabet. 

If  the  rate  is  defined  in  natural  units  as  R  »  ,  Eq.  15  can  be  rewritten 


(15) 


for  any  p,  0  <  p  *  1 

(16) 

E(p)  ■  EQ(p)  -  pR 

(17) 

^  pr-- 

l/M.  n1+p 

E>>  ■ ta  2  2 

P<V  Pr  (yj  l«k)'  ^ 1  P) 

(IS) 
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It  can  b«  shown  by  straightforward  but  tedious  differentiation  that  Eq(p)  has  a  positive 
first  derivative  and  negative  second  derivative  with  respect  to  p  for  a  fixed  Pfx^).  Opti¬ 
mizing  over  pf  we  get  the  following  parametric  form  for  E(p)  as  a  function  of  R: 

E(p)  -  Ec(p)  -  pE^(p)  (19) 

R(p)  -  E^(p)  for  E^l)  <  R  <  E^(0)  (20) 

E  ■  E  (1)  -  R  for  R  <  E'  (1).  (21) 

o  o 

From  the  properties  of  EQ(p),  it  follows  immediately  that  the  E,R  curve  for  a  given 
choice  of  P(xk)  appears  as  shown  in  Fig.  X-S.  E^(0)  turns  out  to  be  the  mutual  informa¬ 
tion  on  the  channel  for  the  input  distribution  P(x^).  Choosing  Pix^)  to  achieve  channel 


capacity,  we  see  that  for  R  <  C,  the  probability  of  error  can  be  made  to  decrease  expo¬ 
nentially  with  the  block  length,  N.  For  rates  other  than  channel  capacity,  Pfx^)  must 
often  be  varied  away  from  the  capacity  inpit  distribution  to  achieve  the  best  exponent. 
Equations  19-21  are  equivalent  to  those  derived  by  Fano,1  and  for  E^(l)  <  R  <  E^(0), 

a  lower  bound  to  P  can  also  be  derived1  of  the  form  P^  >  K(N)  e-NE^,  where  E(p)  is 
©  © 

the  same  as  in  Eq.  19,  and  K(N)  varies  as  a  small  negative  power  of  N. 

R.  G.  G  ail  age  r 
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A.  COMPUTER  FORMULATION  AND  SOLUTION  OF  ALGEBRA  PROBLEMS 

GIVEN  IN  A  RESTRICTED  ENGLISH 

A  hand  analysis  of  eighty  algebra  word  problems  has  been  made.  Several  prelimi¬ 
nary  processing  programs  were  written  in  COMIT  and  debugged.  It  was  then  found  that 
a  more  general  list-processing  language  such  as  LISP  would  be  much  more  convenient. 
However,  a  combination  of  the  two  languages  would  be  better  than  either  alone.  There¬ 
fore,  a  COMIT  interpreter  (METEOR)  was  written  in  LISP,  and  debugged.  It  will 
handle  simple  COMIT  rules  and  program  flow,  with  added  provision  for  mnemonic 
labelling  of  constituents.  Any  LISP  function  can  be  applied  to  a  constituent,  or  set  of 
constituents,  from  the  work  space.  The  input-output  features  of  COMIT  have  been 
implemented,  but  the  rules  are  entered  in  a  LISP  format.  Certain  other  changes  have 
been  made  to  COMIT  format  which  seemed  natural  and  facilitated  programming. 

A  memorandum  has  been  published  by  the  Artificial  Intelligence  Group  which 
describes  in  detail  the  METEOR  program  and  its  use  in  the  LISP  system. 

D.  G.  Bobrow 
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A.  ANALYSIS  OF  ERROR  SCORES  DERIVED  FROM  AN  AUTOMATIC  VOWEL 

REDUCTION  SCHEME 

In  an  evaluation  of  the  performance  of  an  automatic  procedure  for  reducing  vowel 
spectra  to  descriptions  in  terms  of  the  time-variant  vocal -tract  resonances  and  source 
characteristics,  the  behavior  of  the  squared -error  score  has  been  examined  in  some 
detail.  This  examination  provides  some  insight  into  the  spectrum -matching  accuracy 
of  the  automatic  program  and,  it  is  hoped,  may  lead  also  to  better  understanding  of  the 
operational  meaning  of  an  adequate  match  in  terms  of  error  score. 

1  2 

The  automatic  program  under  investigation  has  already  been  described,  *  and  its 

3  4 

accuracy  in  matching  spectra  has  been  discussed  '  in  terms  of  differences  in  pole 
specifications  between  vowel  samples  analyzed  by  the  automatic  procedures  and  by  man¬ 
ual  procedures.  The  manual  reduction  methods  are  described  in  detail  in  a  paper  by 
Bell  et  al.5  The  speech  materials  are  those  reported  by  Housed  consisting  of  eight 
vowels  in  14  consonantal  contexts  spoken  by  three  talkerB. 

Three  contiguous  spectral  samples  located  medially  in  each  of  the  vowels  were  ana¬ 
lyzed  manually  and  automatically,  and  for  each  such  analysis  there  corresponds  a 
squared -error  score.  The  mean-error  scores  associated  with  each  vowel  and  talker 
are  displayed  in  Table  XU-1  according  to  spectrum -matching  method.  Inspection  of 
these  scores  indicates  that  the  magnitude  of  the  mean  error  associated  with  a  given 
vowel,  for  example,  is  reasonably  stable  from  talker  to  talker,  and  that  the  average 
error  score  of  each  talker  is  highly  similar.  There  is,  however,  a  statistically  signif¬ 
icant  difference  between  the  magnitude  of  the  mean -error  Bcores  of  the  two  methods, 
manual  and  automatic,  with  the  latter  providing  lower  scores.  This  tendency  is  quite 
consistent  when  scores  for  individual  vowels  and  talkers  are  studied.  For  both  the  man¬ 
ual  and  automatic  analyses,  the  vowel  means  display  significant  differences,  with  higher 
scores  tending  to  be  associated  with  the  back,  rounded  vowels.  This  tendency  may  be 
aggravated  when  rounded  vowels  are  in  an  environment  provided  by  labial  consonants. 

In  general,  however,  consonantal  context  does  not  provide  a  significant  source  of  error- 
score  variation. 

The  mean-error  scores  per  filter  for  the  two  methods  of  analysis  are  shown  in 
Table  XII -2.  This  presentation  emphasizes  the  consistency  with  which  the  automatic 

*This  research  was  supported  in  part  by  the  U.  S.  Air  Force  (Electronic  Systems 
Division)  under  Contract  AF  19(604)-6102;  in  part  by  the  National  Science  Foundation 
(Grant  G-16526). 
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Table  XU-1.  Mean-error  scores  (in  db  )  for  spectrum  matches  to  various  vowels, 
arranged  by  methods  and  talkers.  Scores  are  computed  over  filters  1  -24 
corresponding  to  a  frequency  range  of  100-3050  cps.  Each  entry  is  based 
on  three  centrally  located  samples  taken  from  vowels  in  14  consonantal 
contexts. 


Method 

Talkers 

Vowels 

Mean 

i 

I 

e 

X 

a 

A 

U 

a 

1 

45 

34 

29 

33 

30 

37 

66 

55 

41 

Automatic 

2 

65 

42 

41 

36 

46 

43 

65 

56 

49 

3 

40 

35 

31 

28 

34 

38 

59 

65 

41 

Mean 

50 

37 

33 

32 

37 

39 

63 

59 

44 

1 

49 

48 

48 

48 

40 

47 

80 

62 

53 

Manual 

2 

49 

54 

54 

46 

42 

48 

53 

55 

50 

3 

44 

44 

47 

39 

46 

46 

59 

66 

49 

Mean 

47 

48 

49 

45 

43 

47 

64 

61 

51 

Table  XU-2.  Mean  error  scores  (in  db  )  per  filter  for  spectrum  matches  to  various 
vowels,  arranged  by  methods.  Scores  are  computed  over  filters  1  -24 
corresponding  to  a  frequency  range  of  100-3050  cps.  Each  entry  is 
based  on  three  centraUy  located  samples  taken  from  vowels  in  14  con¬ 
sonantal  contexts  as  spoken  by  three  talkers. 


Method 

Vowels 

Mean 

L 

I 

£ 

X 

a 

A 

U 

a 

Automatic 

2.  1 

1.6 

1.4 

1.4 

1.5 

1.6 

2.5 

1.8 

Manual 

2.0 

2.0 

2.  1 

1.9 

1.9 

2.0 

n 

2.5 

2.  1 
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procedure  provides  lower  scores.  According  to  this  table  the  (It  of  the  vowel  data  is, 
on  the  average,  accurate  to  better  than  2  db  in  each  of  24  frequency  bands. 

A.  S.  House.  A.  P.  Paul 
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A.  TRANSLATING  ORDINARY  LANGUAGE  INTO  FUNCTIONAL  LOGIC 


Work  continues  on  the  COMIT  program,  which  was  described  in  Quarterly  Progress 
Report  No.  68  (pages  174-175),  for  translating  ordinary  language  into  symbolic  logic. 
The  last  report  described  a  program  for  translating  sentences  and  arguments  formu¬ 
lated  in  ordinary  English  into  the  notation  of  propositional  logic.  Since  that  time,  the 
author  has  been  working  on  the  extension  of  that  program  to  perform  translations  into 
a  "higher"  type  of  logic,  i.e.,  first-order  functional  calculus.  The  work  that  he  has 
done  thus  far  on  this  subject  may  be  described  under  three  headings:  (I)  the  formula¬ 
tion  of  a  logical  grammar,  which  states  the  essential  logical  characteristics  of  some 
types  of  statements  in  ordinary  language  which  are  most  readily  analysable  into  for¬ 
mulae  in  functional  calculus;  (H)  the  writing  of  a  COMIT  program  that  employs  this 
grammar  for  random  generation  of  sentences;  and  (HI)  the  writing  of  a  COMIT  pro¬ 
gram  for  recognition  of  the  grammatical  structures  of  the  sentences  of  the  relevant 
type. 

As  for  (I),  the  sentences  that  lie  within  the  scope  of  our  grammar  are  of  two  types. 
In  the  first  type  of  sentence,  an  'NP'  (the  commonly  accepted  abbreviation  of  'noun 
phrase')  is  connected  to  another  NP  by  a  form  of  the  verb  'to  be*.  In  the  second  type, 
an  NP  is  linked  to  another  NP  by  what  we  may  rather  aententiously  call  a  "binary  rela¬ 
tional  construction,"  by  which  we  mean  a  word  or  a  phrase  such  as  'helps',  'belongs  to', 
or  'is  greater  than'.  These  words  and  phrases  do  not  fit  snugly  into  any  one  grammat¬ 
ical  category  (for  instance,  they  cannot  all  be  described  as  transitive  verbs),  but  all 
do  the  job  of  stating  a  comparison  or  relation  between  two  essentially  different  NPs. 
These  two  sentential  types  are  succinctly  stated  in  two  COMIT  rules: 

(1)  S  *  NP  +  BE  +  NP;  (2)  S-NP+P/.2+NP 

in  which  'S'  stands  for  'sentence',  'BE'  for  the  verb  'to  be',  'F/.2'  for  any  binary  rela¬ 
tional  construction,  and  'NP',  as  stated  above,  for  'noun  phrase'.  Moreover,  'NP*  may 
be  expanded  in  either  of  two  ways,  formulable  in  the  COMIT  rules: 

(3)  NP  «  CONST;  (4)  NP  •  Q/X  +  PHI/X. 

According  to  (3),  'NP'  may  be  replaced  by  a  proper  name,  such  as  George  Washington' 


*This  work  was  supported  in  part  try  the  National  Science  Foundation  (Grant 
G -24047). 
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or  'Chicago1.  According  to  (4),  'NP*  may  be  replaced  by  a  function  'PHI'  of  X  preceded 
by  a  determiner  or  quantifier  'Q'  for  X.  The  'Qa*  of  X  are  phraaes  like  'the  X',  'some 
X',  'exactly  five  X',  and  'at  leaat  two  X*.  The  'PHIa'  of  X  are  expreasible  in  the  fol¬ 
lowing  COMIT  rule*  (a)-(g),  in  which  the  only  aymbol  not  previously  explained  is  'P/.l', 
which  stands  for  nontransitive  (or  "unary")  predicate  constructions  such  as  'is  a  horse', 
'is  a  Professor  of  Greek',  'lives',  or  almost  any  verb  or  verbal  construction  that  is  not 
followed  by  a  direct  object.  (Many  verbs,  of  course,  are  not  definitely  classifiable 
either  as  transitive  or  intransitive,  e.g.,  'believes'  and  'sees'.  In  such  cases  the  rec¬ 
ognition  program  has  to  look  at  what  immediately  follows  the  verb  before  opting  defi¬ 
nitely  either  for  P/.l  or  P/.2). 

(a)  PHI/X  «  X  +  P/.  1  (e)  PHI/X  »  Y  +  P/.2  +  X 

(b)  PHI/X  «  X  +  P/.2  +  NP  (f)  PHI/X  ■  PHI/X  +  AND  +  PHI/X 

(c)  PHI/X  =  NP  +  P/.2  +  X  (g)  PHI/X  «  PHI/X  +  OR  +  PHI/X 

(d)  PHI/X  *  X  +  P/.2  +  Y 

In  the  formulae  above.  X  and  Y  may  be  replaced  by  any  letters  from  A  to  Z,  subject 
to  certain  conditions.  Whenever  'NP*  is  expanded  into  'Q/X  +  PHI/X',  in  accordance 
with  formula  (4).  the  'X'  chosen  is  the  next  letter  in  alphabetical  order;  i.e.,  the  first 
substitution  for  'X'  is  'A';  the  second,  'B';  and  so  on.  (An  exception  to  this  rule  is 
made  in  sentences  of  type  I,  in  which  one  may  begin  expanding  both  'NPs'  with  the  same 
letter  'A'.)  The  'Y',  in  (d)  and  (e)  above,  is  any  letter  that  has  been  previously  used. 
The  following  examples  (i)-(vii)  are  all  examples  of  'NPs',  according  to  our  grammar. 
They  illustrate,  respectively,  rules  (a)-(g)  above  for  the  expansion  of  'PHI/X'.  In  each 
case,  the  phrase  'such  that',  although  logically  redundant,  is  inserted  between  'Q/X' 
and  'PHI/X'  to  improve  legibility.  The  generation  program,  also,  employs  this  device. 

(i)  The  A  such  that  A  is  a  horse. 

(li)  At  least  one  D  such  that  D  admires  George  Washington. 

(iii)  Some  B  such  that  Chicago  likes  B. 

(iv)  Exactly  three  C  such  that  C  notices  A. 

(v)  All  B  such  that  A  is  greater  than  B. 

(vi)  No  A  such  that  A  is  a  horse  and  A  is  Chicago. 

(vii)  At  most  five  C  such  that  C  is  a  Professor  of  Greek  or  C  is  a 
Professor  of  Latin. 

An  example  of  a  complete  sentence  which  may  be  translated  into  the  terminology  above 
is  die  following,  taken  from  I.  M.  Copi's  Symbolic  Logic. 1 
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(viii)  The  architect  who  designed  Tappan  Hall  designs  only 
office  buildings. 

Stated  in  our  terminology,  it  becomes 

(ix)  The  A  such  that  A  is  an  architect  and  A  designs  Tappan 
Hall  designs  only  B  such  that  B  is  an  office  building. 

It  may  be  noted  that  two  different  variables,  'A'  and  'B',  are  employed,  and  that  the 
imperfect  'designed'  is  replaced  by  the  present  'designs'.  This  replacement  simpli¬ 
fies  the  sentence  by  reducing  what  are,  on  the  face  of  it,  two  transitive  verbs  to  one; 
and  it  does  not  affect  the  logic  of  the  sentence.  An  example  of  a  sentence  that  the 
machine  randomly  generated  using  our  grammar  is 

(x)  SIWASH  +  LUCES  +  EXACTLY  +  ONE  tX/At  SUCH  +  THAT  +  X/A  +  IS  +  A  + 

WOMAN  +  AND  +  SOME  +  X/B  +  SUCH  +  THAT  +  X/B  +  CAN  +  SWIM  +  FASTER  + 
THAN  +  ANY  +  X/C  +  SUCH  +  THAT  +  X/C  +  IS  +  JACK  +  OF  +  X/A  +  CAN  + 
COMMAND  +  X/A  +  AND  +  X/A  +  GETS  +  RUSSIA  +  . 

The  last  example,  as  opposed  to  the  immediately  preceding  one,  is  stated  in  COMIT 
notation.  We  have  found  it  expedient  to  use  as  variables  'X/A',  'X/B',  'X/C',  etc., 
rather  than  'A',  'B',  'C',  etc.  In  addition  to  strictly  programmatic  advantages,  this 
device  avoids  any  confusion  between  the  article  'A'  and  the  letter  'A'. 

Next,  the  recognition  program  takes  sentences  such  as  (ix)  and  (x)  and  "parses" 
them,  i.e.,  it  tags  each  grammatical  construction  with  its  appropriate  label.  It  thereby 
reveals  how  the  sentences  must  have  been,  or  could  have  been,  generated.  Using  the 
recognition  program,  the  computer  correctly  printed  out  the  analyses  of  (ix)  and  (x); 
these  are  included  below  as  (xi)  and  (xii),  respectively. 

(xi)  S  +  NP  +  Q/A  +  THE  +  X/A  +  PHI/A  +  PHI/A  +  X/A  +  P/.l  + 

IS  +  AN  +  ARCHITECT  +  AND/A  +  PHI/A  +  X/A  +  P/.2  + 

DESIGNS  +  NP  +  TAPP  AN/CONST  +  HALL/CONST  +  P/.2  + 

DESIGNS  +  NP  +  Q/B  +  ONLY  +  X/B  +  PHI/B  +  X/B  +  P/.l  + 

IS  +  AN  +  OFFICE  +  BUILDING  +  . 

(xii)  S  +  NP  +  SIWASH/CONST  +  P/.2  +  LIKES  +  NP  +  Q/A  +  EXACTLY  + 

ONE  +  X/A  +  PHI/A  +  PHI/A  +  PHI/A  +  X/A  +  P/.  1  +  IS  +  A  + 

WOMAN  +  AND/A  +  PHI/A  +  NP  +  Q/B  +  SOME  +  X/B  +  PHI/B  +  X/B  + 

P/.2  +  CAN  +  SWIM  +  FASTER  +  THAN  +  NP  +  Q/C  +  ANY  +  X/C  + 

PHI/C  +  X/C  +  P/.2  +  IS  +  JACK  +  OF  +  X/A  +  P/.2  +  CAN  + 

COMMAND  +  X/A  +  AND/A  +  PHI/A  +  X/A  +  P/.2  +  GETS  +  NP  + 

RUSSIA/CONST  +  . 


It  may  be  noted  that  the  recognition  program  erases  the  redundant  phrase  'such  that' 
whenever  it  encounters  it  in  the  input  sentences.  Also,  it  adds  some  additional  sub¬ 
scripts.  Proper  names  are  subscripted  with  'CONST'  (for  'constant'),  and  conjunctions 
('AND*  and  'OR')  are  subscripted  with  the  letter  possessed  by  the  two  'PHIs'  which  they 
connect.  To  discover  the  scope  of  any  'S',  'NP',  or  'PHI',  one  may  employ  the  following 
counting  method:  Starting  Immediately  to  the  left  of  die  symbol  whose  scope  is  to  be 
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discovered,  initialize  the  count  at  1  (in  COMIT,  one  would  use  a  marker  with  the  numer¬ 
ical  subscript  of  1,  e.g.,  'M/.l'),  and  proceeding  from  left  to  right  raise  or  lower  the 
count  in  accordance  with  the  rules: 

PHI  ,  S  -  +2;  NP  «  +1;  P/.2  ,  X  ,  AND  ,  OR  ,  BE  »  -I; 

P/.l  ,  any  consecutive  string  of  items  labelled  'CONST'  *  -2; 

all  other  items  *  0. 

The  scope  of  the  term  in  question  is  determined  as  soon  as  the  count  0  is  reached. 
More  precisely,  when  the  count  0  is  reached,  one  continues  counting  items  as  long  as 
the  count  remains  0.  When  an  item  is  encountered  which  causes  the  count  to  deviate 
from  0,  the  counting  stops,  and  the  scope  is  taken  to  be  everything  except  the  last  item. 

|  In  other  words,  if  'P/.  1 '  causes  the  count  to  go  to  0,  the  counting  continues  until  it 

includes  everything  labelled  'P/.l',  such  as  'IS  +  A  +  HORSE'. 

The  counting  method  just  described  for  determining  the  scope  of  a  term  has  not  yet 
j  been  programmed,  but  it  will  be  an  essential  part  of  the  next  program  that  we  intend  to 

|  write,  which  will  translate  the  parsed  sentences,  such  as  (xi)  and  (xii),  into  purely 

logical  notation.  The  next  task  following  the  completion  of  this  will  be  to  write  a  pro¬ 
gram  that  will  translate  sentences  stated  in  the  ordinary  English  of  (viii)  into  the  quasi- 
logical  language  of  (ix)  and  (x).  These  two  uncompleted  programs,  in  conjunction  with 
those  already  written,  will  constitute  a  complete  program  for  translating  our  particular 
types  of  sentences  from  ordinary  English  into  first-order  functional  calculus. 

J.  L.  Darlington 
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!  B.  FUNDAMENTAL  SENTENCE -MEANING 

1  -3 

In  previous  Quarterly  Progress  Reports  I  have  pointed  out  that  a  general  explana¬ 
tion  of  the  phenomenon  of  linguistic  paraphrase  cannot  be  achieved  within  the  framework 
of  a  theory  of  semantics  that  makes  the  tacit  assumption  that  word-synonymy  (or 
morpheme-synonymy)  alone  can  account  for  it.  Such  a  view  not  only  leads  inexorably 
to  the  absurd  conclusion  that  a  structural -constant,  such  as  'all',  'only',  'any',  'some', 
'not',  'ever',  must  at  times  be  considered  as  synonymous  with  one  structural-constant 
|  and  at  other  times  synonymous  with  a  different  structural -constant,  but  fails  to  pro¬ 

vide  the  technical  means  powerful  enough  to  discover  fundamental  semantical  relations 
that  exist  among  these  structural  morphemes,  relations  that  can  be  formulated  into  lin¬ 
guistic  laws  that  are  general  in  that  they  encompass  vast  areas  of  linguistic  phenomena. 
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and  are  explanatory  In  that  they  provide  reasons  why  one  cannot  always  substitute  the 
same  structural -constant  for  another  In  all  sentences  containing  an  occurrence  of  that 
morpheme  and  obtain  sentences  that  are  synonymous,  why  and  when  there  occurs  a 
change  of  meaning  if  one  makes  morpheme  substitution  of  this  kind,  and  so  on.  The 
reason  why  a  method  of  discovery  based  upon  such  a  view  of  semantics  cannot  be  pow¬ 
erful  enough  to  discover  more  fundamental  laws  is  that  it  has  been,  by  definition, 
restricted  to  the  use  of  only  substitution  rules  for  the  greater  part.  That  is  to  say,  on 
the  basis  of  this  assumption  concerning  semantics,  the  structure  of  the  grammatical 
strings  —  the  very  structure  that  is  to  be  Investigated  —  has  been  predetermined  to  be 
essentially  of  the  phrase -structure  type,  wholly  accessible  to  constituent  analysis,  with 
some  rules  of  ordering  of  constituents  thrown  in  when  necessary.  The  type  of  grammar 
has  already  been  assumed  to  be  a  context-free  grammar. 

In  the  theory  of  semantics  proposed  by  the  author,  it  is  held  that  the  word-meaning 
of  these  critical  structural  morphemes  always  remains  constant  —  hence  the  name 
structural-constant  —  and  that  the  sentence -meaning  can  vary  according  to  the  partic¬ 
ular  configuration  of  structural -constants  which  occurs  in  the  sentence.  Some  config¬ 
urations  give  rise  to  sentences  whose  sentence -meanings  are  synonymous  although  there 
is  no  one-to-one  morpheme  correspondence,  and  others  give  rise  to  sentences  whose 
sentence -meanings  are  different.  In  order  to  illuminate  how  the  meaning  of  a  seman¬ 
tical  structural -constant  can  remain  constant  and  at  the  same  time  the  sentence -meaning 
can  vary  depending  upon  the  structural  context,  let  us  construct  the  following  analogy. 

Let  us  liken  the  structural  configuration  of  a  sentence  -type  to  a  machine  constructed 
out  of  cogwheels,  levers,  rods,  bolts,  screws,  and  other  such  parts.  The  particular 
set  of  parts  and  the  manner  in  which  they  are  connected  to  one  another  give  rise  to  a 
machine  that  has  a  certain  output.  The  same  parts,  when  combined  in  a  different  way, 
give  rise  to  a  different  machine.  It  may  happen  that  particular  choices  of  parts  and 
different  combinations  of  them  give  rise  to  several  machines  that  operate  to  produce 
the  same  output,  although  they  are  not  structurally  isomorphic;  these  could  be  called 
synonymous  machines.  In  this  case  it  could  be  said  that  the  machines  have  the  same 
fundamental  construction.  In  the  case  in  which  several  machines  are  not  structurally 
isomorphic  and  do  not  operate  to  produce  the  same  output,  the  machines  are  said  to 
have  different  fundamental  constructions .  A  construction  engineer  knows  which  choice 
of  parts  and  which  combinations  of  them  give  rise  to  the  various  kinds  of  machines.  In 
the  case  in  which  the  engineer  has  built  two  or  more  machines  that  appear  to  be  struc¬ 
turally  isomorphic  but  operate  to  produce  different  output,  some  of  the  machine  parts 
have  become  built-in  units;  the  structures  that  differ  are  hidden  under  the  cover  of  a 
closed  unit.  Such  machines  could  be  said  to  be  ambiguous. 

It  is  obvious  from  the  analogy  that  the  author  believes  that  only  a  context-sensitive 
grammar  is  adequate  to  handle  the  structure  of  language.  The  analogy  is  so  obvious 
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that  one  need  not  dwell  on  it.  But  I  would  like  to  discuss  briefly  the  semantical 
structural -constants,  'all'  and  'only'.  It  is  only  when  one  regards  'all'  as  a  constant, 
analogous  to  a  functioning  meaningful  machinery  part,  that  one  can  explain  why  'all*  can 
range  in  meaning  over  'everything',  'the  totality',  'the  whole',  'the  end',  etc.  As  an 
operator  whose  function  is  to  determine  the  extent  of  the  upper  boundaries  enclosing 
all  members  from  the  inside  to  the  outer  limits,  it  can  range  over  many  different  types 
of  things;  sometimes  it  operates  on  countable  argument  individuals,  sometimes  on 
predicate  functions  such  as  verbs,  adjectives,  and  nouns,  sometimes  on  functions  of 
functions  such  as  adverbs.  Sometimes  it  operates  upon  the  infinitely  divisible  parts 
belonging  to  a  whole  or  single  individual.  But  it  always  has  one  job  to  do,  a  job  that 
never  varies.  The  morpheme  'only'  also  functions  to  determine  a  boundary,  but,  unlike 
'all',  it  does  not  specify  that  tnt  total  membership  is  involved;  rather  it  restricts,  or 
clamps  the  lid  down  on  the  upper  boundary  from  outside  the  set.  'AH'  and  'only'  are 
closely  related  operators  in  that  they  operate  in  the  reverse  of  each  other.  Examples  (1) 
and  (3)  have  been  selected  to  illustrate  the  comparison.  Since,  however,  there  does 
not  exist  a  logical  operator  corresponding  notationally  to  the  grammatical  form  of  'only', 
'only'  is  expressed  symbolically  in  the  predicate  calculus  through  a  paraphrase  of  the 
sentence  containing  an  occurrence  of  'only',  a  paraphrase  in  which  'all'  appears.  The 
English  language  often  makes  use  of  this  paraphrase  to  express  'only'.  Illustrations  of 
this  usage  are  given  in  examples  (7)  and  (9)  and  again  in  examples  (20a)  and  (20b).  In 
order  that  people  untrained  in  the  logical  symbolism  of  the  predicate  calculus  can  follow 
the  text  more  easily,  I  shall  not  use  symbolic  notation  in  this  report.  However,  it  must 
be  kept  in  mind  that  for  each  sentence  in  the  examples  there  is  a  general  schematization, 
a  symbolic  formula  in  which  the  denotative  terms  of  the  sentence  are  expressed  as  vari¬ 
ables,  only  the  range  of  the  variable  having  structural  significance;  this  formula  repre¬ 
sents  the  sentence -type  of  the  sentence  in  question. 

In  order  to  see  the  effect  that  'only'  can  make  on  the  fundamental  sentence -meaning, 
let  us  first  examine  an  example  of  a  rather  simple  sentence  -type  whose  subject  noun 
phrase  contains  a  universal  quantifier, 

(1)  All  women  are  frail 

whose  fundamental  sentence -meaning  is 

(2)  To  be  a  woman  is  a  sufficient  condition  of  being  frail. 

If  we  replace  'all'  by  'only'  in  sentence  (1),  we  obtain 

(3)  Only  women  are  frail 

whose  fundamental  sentence -meaning  is  the  reverse  of  sentence  (2),  namely, 

(4)  To  be  a  woman  is  a  necessary  condition  of  being  frail. 
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The  morpheme  'only'  functions  here  to  reverse  the  order  of  the  asymmetrical  operation 
of  implication.  !n  other  words,  if  'all'  is  the  major  quantifier  of  a  sentence  S  that  can 
be  represented  by  the  general  formula  in  the  predicate  calculus  '(x)[f(x)og(x)]'  (read:  for 
every  x,  if  x  is  f  then  x  is  g),  then  if  'only*  is  the  major  quantifier  of  a  sentence  S' 
containing  all  the  same  morphemes  as  S  with  the  exception  of  'all',  then  the  general 
formula  is  written  '(x)[g(x)3f(x)]'  (read:  for  every  x,  if  x  is  g  then  x  is  f).  Inspection 
of  the  symbolic  forms  of  these  two  formulas  reveals  the  reversal  in  the  order  of  the 
left-hand  term  before  the  symbol  for  implication  and  the  right-hand  term  after  the  sym¬ 
bol. 

Given  an  example  of  a  sentence -type  like 

(5)  John  needs  to  move  to  feel  pain 

which  already  expresses  necessary  condition,  since  its  fundamental  sentence -meaning 
is 

(6)  Moving  is  a  necessary  condition  that  John  must  satisfy  in 
order  to  feel  pain 

we  see  that  the  addition  of  'only'  after  'needs'  in  sentence  (5)  serves  again  to  reverse 
the  fundamental  sentence-meaning.  The  resulting  sentence 

(7)  John  needs  only  to  move  to  feel  pain 

expresses  the  fundamental  sentence -meaning 

(8)  Moving  is  the  only  condition  that  John  must  satisfy  in  order  to  have  pain. 

The  reversal  that  the  addition  of  'only'  accomplishes  is  seen  in  the  fact  that  sentence  (8) 
implies  that  not  only  is  moving  a  necessary  condition  of  feeling  pain  but  a  sufficient 
condition;  that  is,  since  moving  is  the  only  condition  needed,  it  is  foe  necessary  and 
sufficient  condition  of  producing  pain.  Sentence  (7)  is  synonymous  with 

(9)  All  John  needs  to  do  is  to  move  to  feel  pain 

since  both  express  the  same  fundamental  sentence -meaning.  The  fundamental  sentence - 
meaning  could  be  expressed  in  a  grammatical  form  notationally  closer  to  sentence  (9)  by 

(10)  All  conditions  that  John  must  satisfy  in  order  to  feel  pain 
are  the  same  as  to  move. 

Incidentally,  note  that  both  sentence  (7)  and  sentence  (9)  are  synonymous  with 

(11)  John  need  only  move  to  feel  pain 

and 

(12)  All  John  need  do  to  feel  pain  is  to  move 
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in  both  of  which  'need'  is  clearly  a  modal.  Compare  sentences  (11)  and  (12)  with  the 
sentence -type  of  sentence  (5)  in  which  there  is  not  the  option  of  using  the  modal  form 
or  the  verbal  form  of  'need1;  the  verbal  form  is  obligatory. 

By  understanding  the  role  that  the  semantic  and  purely  syntactic  structural -constants 
play  in  giving  rise  to  sentence -meaning,  there  does  not  seem  to  be  anything  more  myste¬ 
rious  in  the  concept  of  sentence -synonymy  than  there  is  in  two  maps,  very  different  in 
outward  shape,  representing  the  same  country  through  different  projections.  As  long 
as  there  exists  a  one-to-one  mapping  between  the  projections,  the  two  maps  are  regarded 
as  fundamentally  isomorphic.  In  the  investigation  of  fundamental  semantical  relations  of 
a  natural  language  system,  we  search  for  translation  rules  that  transform  one  sentence 
into  another  sentence  that  is  synonymous  with  it.  These  translation  rules  are  the  mapping 
rules  of  the  linguist  interested  in  a  semantically  grounded  grammar. 

Every  translation  rule  is  justified  as  a  correct  rule  if  and  only  if  there  corresponds 
to  it  a  metalanguage  statement,  empirically  established  as  true,  which  states  the  funda¬ 
mental  sentence -meaning  that  the  synonymous  sentences  have  in  common.  Basic  trans¬ 
lation  rules  are,  of  course,  not  easy  to  discover,  particularly  if  the  problem  is  attacked 
by  a  hit-or-miss,  trial-and-error  method.  Since  the  domain  of  linguistic  data  is  so 
immense,  a  procedure  without  a  logically  directed  program  of  attack  can  be  quite  fruit¬ 
less.  Fortunately,  one  can  utilize  the  knowledge  gained  from  the  advances  made  in  the 
understanding  of  the  logical  structure  of  artificial  language  systems  such  as  the 
predicate-calculus  to  construct  such  a  program,  even  though  one  knows  from  experience 
that  a  natural  language  system  is  not  as  limited  in  scope  or  purpose  as  an  artificial  one. 

If  we  regard  the  artificial  logical  calculi  as  analogs  of  natural  language  systems,  we  are 
justified  in  assuming  that  there  must  exist  a  metalogic  for  any  language  system,  that  is, 
a  body  of  implicit  basic  rules  governing  the  use  of  the  conventional  symbols  in  the  lan¬ 
guage;  implicit  derivation  rules,  formation  rules,  and  axioms.  If  there  did  not  exist  such 
a  logical  set  of  rules  governing  the  grammatical  forms  of  sentences,  it  would  be  difficult 
to  understand  how  a  language  could  be  mastered,  since  it  would  then  not  be  a  system  but 
a  very  large  set  of  unconnected  strings.  When  one  masters  a  language  one  has  learned 
how  to  use  these  rules;  but  there  is  a  difference  between  being  able  to  use  a  rule  and 
being  able  to  state  the  rule  explicitly.  Observation  and  intuitive  knowledge  of  the  lan¬ 
guage  in  question  can  lead  to  the  establishing  of  a  set  of  explicit  translation  rules, 
accepted  as  correct  because  many  sentences  have  been  tested  empirically  to  confirm  the 
fact  that  the  structural-constants  do,  in  fact,  behave  in  the  way  stipulated.  Thus  when 
one  has  stated  some  of  the  metalogical  rules,  one  can  use  them  to  demonstrate,  according 
to  the  metalogic  of  that  language,  that  certain  sentences  are  synonymous.  That  is,  one 
can  validate  the  intuitive  recognition  of  synonymy  of  several  sentences  by  logical  proof, 
a  logical  proof  that  utilizes  only  a  partial  knowledge  of  the  meanings  of  some  important 
structural -constants  in  that  language,  the  meanings  of  the  denotative  constants,  and  the 
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fundamental  meanings  of  a  few  well-chosen  sets  of  synonymous  sentences  regarded  as 
axioms,  and  the  logical  rules  of  deductive  inference.  If  the  synonymy  of  sentences  can 
be  validated,  one  is  confirmed  in  believing  that  the  rules  already  formulated  are  actually 
basic  to  that  language  system.  Furthermore,  new  structural -constants  and  new  rules 
can  be  discovered  in  terms  of  the  old.  The  following  examples  have  been  selected  to 
illustrate  how  English  sentences,  found  intuitively  to  be  synonymous,  can  be  proved  to 
be  synonymous  by  a  tentatively  accepted  metalogic  of  English. 

The  concepts  of  upper  and  lower  bounds  play  an  important  role  in  the  English  lan¬ 
guage.  Since  the  quantifier  'all1  is  essential  in  symbolic  formulas  expressing  ideas 
concerning  limits,  let  us  compare  some  sentences  containing  different  grammatical 
occurrences  of  'all'  with  their  respective  intuitively  synonymous  sentences. 

(13a)  I  helped  him  all  I  could 
(13b)  I  helped  him  as  much  as  I  could 

(14a)  St.  Paul  is  the  farthest  I  will  go 
(14b)  I  will  go  only  as  far  as  St.  Paul 
(14c)  St.  Paul  is  all  the  farther  I  will  go 

(15a)  What  big  eyes  you  have)  All  the  better  to  see  you,  my  dear 
(15b)  What  big  eyes  you  havel  So  much  the  better  to  see  you,  my  dear 

(16a)  He  was  all  but  smiling 

(16b)  He  was  almost  (nearly)  smiling 

(17a)  He  was  anything  but  smiling 
(17b)  He  was  far  from  smiling 

(18a)  All  I  could  do  was  to  smile 
(18b)  I  could  only  smile 

(19a)  It  was  all  I  could  do  to  smile 
(19b)  I  could  hardly  smile 

(20a)  I  was  all  afire 
(20b)  I  was  completely  afire 

(21a)  All  of  the  class  was  excited 
(21b)  The  whole  class  was  excited 

According  to  grammarians  such  as  Poutsma,  'all'  in  sentence  (13a)  is  to  be  equated 
with  'as  much  as1  in  sentence  (13b),  since  these  words  can  be  interchanged  with  the 
meaning  of  the  original  sentence  preserved.  According  to  the  principle  of  sentence- 
synonymy,  the  two  sentences  as  wholes  have  the  same  fundamental  sentence -meaning 
but  no  synonymy  holds  between  'all'  and  'as  much  as'.  In  sentence  (13a),  the  universal 
quantifier  ranges  over  the  set  of  possible  degrees  of  help  that  x ^  was  able  to  give;  thus 
Xj  helped  x2  with  every  degree  of  help  that  it  was  possible  for  Xj  to  give  x2>  If  Xj  helped 
with  all  possible  degrees,  Xj  helped  with  the  highest  degree  as  well.  That  is  to  say,  Xj 
helped  up  to  the  limit  of  his  ability.  Another  sentence  that  is  synonymous  with 
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sentences  (13a)  and  (13b),  which  corresponds  to  this  last  statement  that  was  derived  by 
logical  inference,  is  'I  helped  him  the  most  I  could'.  Sentence  (13b),  on  the  other  hand, 
states  that  the  amount  of  help  given  to  x2  was  'equal  to'  or  'as  much  as'  the  amount  Xj 
was  able  to  give.  Since  the  amount  that  it  was  possible  for  x^  to  give  is,  in  fact,  the 
greatest  amount,  more  being  impossible,  the  amount  of  help  given  equals  the  largest 
or  the  upper  limit.  Ergo,  the  fundamental  sentence -meaning  for  all  three  statements 
has  been  proved  to  be  identical.  Their  sentence -types,  however,  are  different.  It  is 
to  be  noted  that  the  verb  'help',  conceived  as  having  different  intensities  ordered  as  to 
degree,  is  a  necessary  assumption  to  the  explanation  of  the  sentence -synonymy.  The 
English  language  makes  much  use  of  the  notion  of  ordering,  as  we  shall  see  in 
examples  (14) -(19). 

The  sentence -meaning  of  sentence  (14a)  is  a  very  direct  expression  of  the  funda¬ 
mental  sentence -meaning  of  all  three  examples  in  set  (14),  which  is 

(22)  St.  Paul  is  the  upper  limit  of  the  distance  from  an  initial  point 
that  Xj  is  willing  to  go 

because  the  sentence  itself  identifies  by  means  of  the  word  'is'  the  place,  St.  Paul,  with 
'the  point  that  is  farthest',  the  use  of  the  superlative  guaranteeing  the  uniqueness  of  the 
point  that  is  farther  than  all  other  points  x^  is  willing  to  go  to  from  a  given  starting 
point.  The  sentence -meaning  of  (14b)  states  that  Xj  j  j  willing  to  go  (from  some  given 
point)  as  far  as  St.  Paul  but  no  farther;  thus  sentence  (14b)  states  that  is  willing  to 
go  from  an  initial  point  to  a  point  that  is  'equal  to'  or  'as  far  as'  the  upper  bound, 
St.  Paul,  and  does  not  exceed  that  point.  Whether  St.  Paul  is  said  to  be  the  limit  itself, 
or  the  distance  that  x^  is  willing  to  go  is  declared  to  be  as  far  as  but  not  farther  than 
the  limit,  the  distance  in  both  cases  is  the  same;  hence  the  two  sentences  are  synony¬ 
mous.  The  grammatical  differences  and  similarities  between  the  two  sentences  are 
easily  seen  when  both  are  put  into  their  schematic  formulations. 

The  proof  of  the  synonymy  of  sentence  (14c)  to  (14a)  and  (14b)  is  more  complicated. 
The  phrase  'all  the  farther'  refers  to  the  total  increase  in  distance.  The  complete 
meaning  of  'x  is  farther  than  y'  has  to  be  expressed  'x  is  farther  from  an  initial  point 
yQ  than  y  is  by  a  specific  but  unknown  amount  a',  i.e.,  x  increases  in  distance  away 
from  yQ  more  than  y  increases  in  distance  by  a  definite  increase.  It  should  be  noted 
that  in  any  ordering  relation  applied  to  things,  whether  the  ordering  be  in  terms  of 
intensity,  degree,  or  size,  there  is  always  an  amount  involved.  Sentence  (14c)  identi¬ 
fies  two  classes  by  means  of  the  word  'is',  which  stands  between  the  left-hand  phrase 
that  defines  the  set  of  points,  '(from  yQ  to)  St.  Paul',  and  the  right-hand  phrase  that 
defines  the  well-ordered  set  of  points,  'all  the  increases  in  distance  Xj  is  willing  to 
go'.  The  right-hand  phrase  can  be  represented  as  the  union  of  the  collection  of  sets 
containing  as  members  all  of  the  a's,  which  are  obtained  by  taking  in  their  ordered 
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turn  all  those  points  that  are  distant  from  yQ  to  which  x  ^  is  willing  to  go.  Thus  each 
new  point  that  Xj  is  willing  to  go  to  starting  from  yQ  farther  than  the  previous  point 
becomes  an  upper  bound  in  its  turn  so  that  the  smallest  that  each  increase  can  be  is 
simply  from  one  point  to  the  next  farther  point.  The  sum  of  all  of  these  increases  (or 
all  farther  points)  is,  of  course,  equal  to  the  distance  from  yQ  to  St.  Paul  because  the 
sentence  has  equated  these  two  sets.  The  sets  are  identical  because  each  set  has 
exactly  the  same  members.  The  sentence -type  of  (14c)  is  quite  different  from  those 
of  (14a)  and  (14b).  Both  (14a)  and  (14b)  correspond  notationally  to  the  standard  sym¬ 
bolic  definition  of  an  upper  bound;  sentence  (14c),  by  declaring  the  two  sets  to  be  iden¬ 
tical,  lays  down  a  condition  under  which  there  must  exist  an  upper  bound  to  the  set  of 
points  to  which  x^  is  willing  to  go. 

In  (15a)  we  have  the  familiar  lines  in  the  nursery  tale  of  Red  Riding  Hood  and  the 
wolf.  There  is  again  ordering  according  to  size;  but  in  this  example  two  properties 
are  being  ordered  and  the  ordering  is  correlated,  since  the  value  of  the  increase  or 
decrease  in  one  variable  depends  upon  the  determination  of  the  increase  or  decrease 
in  the  other.  The  wolf's  assumption  is 

(23)  The  bigger  the  eye,  the  better  the  sight  of  the  eye. 

The  fundamental  sentence -meaning  of  (23)  is  that  the  goodness  of  the  sight  of  the 
eye  increases  as  a  function  of  the  greatness  of  size  of  the  eye.  In  sentence  (23),  the 
independent  variable  is  given  in  the  first  phrase,  the  dependent  in  the  second,  so  that 
the  ordering  of  the  two  phrases  is  an  indispensable  feature  of  the  sentence -meaning. 

If  the  order  is  reversed,  grammatical  changes  occur;  the  synonymous  sentence  would 
then  be 

(24)  The  sight  of  the  eye  is  better,  the  bigger  the  eye. 

When  Red  Riding  Hood  asserts  that  Grandma's  eyes  are  large,  she  means  that  as 
eyes  they  have  the  property  of  largeness,  since  size  is  never  an  absolute  property  but 
is  relative  to  the  kind  of  object  talked  about.  Hence  the  word  'large'  must  be  defined 
'large  with  respect  to  some  standard'.  Grandma's  eyes  being  a  specific  amount, 
although  an  unknown  amount,  larger  than  average,  the  ability  of  her  sight  is  directly 
correlated  to  that  amount  and  can,  in  theory,  be  calculated.  The  phrase  'all  the  better 
to  see  you'  in  sentence  (15a)  is  similar  to  the  phrase  'all  the  farther'  in  the  previous 
example,  since  it  too  denotes  the  set  of  'all  the  increases  in  goodness  of  seeing'.  The 
sum  of  all  these  increases  equals  the  amount  that  her  eyes  are  larger  than  average. 
Hence  sentence  (15b)  that  asserts  'that  much  (or  so  much)  better  to  see  you'  is  synon¬ 
ymous  with  sentence  (15a)  because  the  union  of  the  set  of  all  the  increases  equals  that 
definite  amount,  the  difference  in  size  of  eye;  hence  'so  much  the  better'  can  replace 
•all  the  better'. 
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Examples  (16) -(19)  illustrate  that  the  English  language  possesses  techniques  for 
handling  the  concept  of  proximity  to  total  absence  or  total  presence  of  a  property.  Does 
a  person  either  smile  or  not  smile?  Our  language  is  able  to  construct  statements 
expressing  how  close  to  the  boundary  of  yes  or  no  a  fact  may  be.  In  sentence  (16a)  Xj 
is  not  smiling  but  very  near  to  the  upper  bound  of  the  separation  between  not  smiling 
and  smiling.  The  definition  of  the  complex  quantifier  'almost'  involves  the  occurrence 
of  the  universal  quantifier  'all'  in  combination  with  the  structural -constant  'except'  or 
'but'  to  exclude  a  limited  finite  number  or  amount  from  the  total  set  —  an  amount  that 
approaches  but  never  reaches  zero  as  the  limit.  Hence,  the  words  'all  but'  correspond 
directly  to  the  definition  of  'almost'. 

Sentence  (17a)  states  that  Xj  was  not  smiling  but  states  that  every  property  that  could 
be  included  as  part  of  smiling  was  absent;  hence  Xj  is  far  from  the  upper  bound  of  the 
separation  between  smiling  and  not  smiling. 

-The  synonymy  of  sentences  (18a)  and  (18b)  was  treated  in  detail  in  Quarterly  Progress 
Report  No.  66  (pages  289-293).  I  included  them  in  this  report  to  complete  the  list  of 
different  grammatical  occurrences  of  'all'  and  to  bring  into  sharp  focus  the  critical  effect 
resulting  from  additional  words  such  as  'It  was'  placed  in  front  of  sentence  (18a),  as  in 
example  (19a).  Sentence  (19a)  expresses  the  sentence -meaning  that  although  Xj  smiled, 
it  was  necessary  for  him  to  use  the  limit  of  his  ability  to  do  so.  Thus  Xj  was  close  to 
the  lower  limit  of  smiling,  that  is,  close  to  not  smiling. 

The  synonymy  of  sentences  (20a)  and  (20b)  is  demonstrated  by  showing  that  the 
universal -quantifier  'all'  ranges  over  all  of  the  parts  of  an  individual.  Since  'all'  is  a 
unity,  it  includes  all  of  its  parts  and  is  therefore  complete. 

The  explanation  of  the  synonymy  of  sentences  (21a)  and  (21b)  is  similar  to  the  expla¬ 
nation  of  set  (20).  It  should  be  pointed  out  that  sentences  (20a)  and  (20b)  are  synonymous 
with  'I  was  wholly  afire'.  In  set  (21),  the  universal -quantifier  'all'  again  operates  upon 
the  parts  of  an  individual.  In  the  case  of  sentence  (21a)  the  use  of  the  definite  article 
before  'class',  a  collective  noun,  shows  that  'class'  is  regarded  as  singular.  'All  of 
the  class'  and  'the  whole  class'  refer  to  the  class  as  a  complete  unity,  that  is,  the 
individual  composed  of  all  of  its  parts. 

It  has  perhaps  been  remarked  that  the  phrase  'fundamental  sentence -meaning'  has 
been  used  frequently.  What  is  the  exact  meaning  of  this  phrase  and  just  what  is  the 
relation  between  fundamental  sentence -meaning  and  ordinary  sentence -meaning?  In 
this  report  I  would  like  to  discuss  and  clarify  this  elusive  concept  and  try  to  explain 
its  role  in  relating  sentences  as  synonymous. 

In  Quarterly  Progress  Report  No.  68  (pages  176-180)  I  stated  that  the  fundamental 
sentence  -meaning  was  logical  in  form,  not  grammatical;  it  was  the  forms  of  the  sen¬ 
tences  found  to  be  synonymous  by  empirical  observation  which  were  grammatical.  On 
the  basis  of  my  present  point  of  view,  I  would  not  now  deny  grammatical  form  to  the 
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sentence  stating  the  fundamental  sentence  -meaning  of  a  set  of  sentences  that  are  alike 
in  expressing  that  sentence -meaning.  Rather,  I  would  describe  the  relationship  of  this 
sentence  to  the  others  as  follows.  While  it  is  true  that  fundamental  sentence -meaning 
is  a  concept  belonging  properly  to  the  field  of  logic,  this  fact  means  that  the  sentence 
stating  the  func!  i  nental  meaning  of  a  set  of  synonymous  sentences  belongs  properly  to 
the  metalanguage  of  our  language  system;  the  synonymous  sentences  referred  to  by  the 
metalinguistic  statement  belong  to  the  object  language  of  English.  The  metalinguistic 
statement  states  explicitly,  by  means  of  denotative  terms,  the  specific  kind  of  relation 
that  exists  between  the  events  or  things  designated  by  specific  types  of  object -language 
sentences.  That  is,  the  metalinguistic  sentence,  by  stating  which  basic  relation  the 
object -language  sentences  express,  is  using  the  name  of  this  particular  relation  deno¬ 
tatively;  thus,  for  example,  the  phrase  'necessary  condition'  is  itself  not  a  structural  - 
constant  but  a  denotative  term,  defined  lexically  in  the  same  manner  as  other  nouns, 
and  behaving  grammatically  as  one.  Like  all  other  sentences,  the  metalanguage  sen¬ 
tence  has  a  grammatical  structure  that  can  be  described,  and  a  sentence-meaning 
of  its  own.  In  fact,  it  is  possible  for  the  metalanguage  statement  itself  to  be  synony¬ 
mous  with  other  metalanguage  statements  if  the  denotative  constants  are  the  same,  as 
we  saw  in  examples  (8)  and  (10).  On  the  other  hand,  the  object-language  sentences  that 
the  metalanguage  sentences  refer  to  as  being  synonymous  do  not  use  the  denotative  term 
to  refer  reflexively  to  themselves;  they  express  the  particular  relationship  that  they 
have  in  common  by  using  the  proper  structural -constants  in  specific  configurations. 
They  are  sentences  in  the  object  language  because  they  use  structural -constants  that 
can  be  defined  only  in  the  metalanguage.  Let  us  inspect  two  synonymous  sentences  in 
the  object  language  that  are  grammatically  similar  in  that  all  of  their  denotative  terms 
are  the  same,  such  as 

(25a)  If  John  is  to  be  president,  he  must  get  his  organization  ready  now 

(25b)  Only  if  John  gets  his  organization  ready  now  can  he  be  president 

and  compare  their  grammatical  forms  with  the  grammatical  form  of  the  mete  language 
sentence  (26)  stating  their  common  fundamental  sentence -meaning 

(26)  The  fundamental  sentence -meaning  of  'If  John  is  to  be  president, 
he  must  get  his  organization  ready  now'  and  'Only  if  John  gets  his 
organization  ready  now  can  he  be  president'  is  that  John's  getting 
his  organization  ready  now  is  a  necessary  condition  for  John's 
being  president. 

Sentences  (25a)  and  (25b)  do  not  explicitly  state  that  one  event  is  a  necessary  condition 
of  the  other;  rather  each  expresses  this  fact  through  its  grammatical  form.  On  the 
other  hand,  sentence  (26),  although  it  too  expresses  through  its  grammatical  form  a 
fundamental  sentence -meaning  that  could  be  made  explicit  by  a  meta -metalanguage 
statement,  states  explicitly  which  fundamental  sentence -meaning  examples  (25a)  and  (25b) 
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have  in  common,  and  the  fundamental  sentence -meaning  of  metalanguage  sentence  (26) 
is  not  the  fundamental  sentence  -meaning  referred  to  by  its  subject  noun  phrase.  How¬ 
ever,  if  we  look  at  only  the  clause  'John's  getting  his  organization  ready  now  is  a  neces¬ 
sary  condition  for  John  to  be  president',  we  see  that  this  part  of  sentence  (26)  has  the 
grammatical  structure  of  a  complete  sentence  and  it  is  synonymous  with  sentences  (25a) 
and  (25b)  even  though  its  denotative  terms  are  not  everywhere  the  same  as  those  of 
(25a)  and  (25b).  Indeed,  if  this  sentence  were  not  synonymous,  it  could  not  have  been 
identified  with  the  fundamental  sentence -meaning  of  these  object -language  sentences.  It 
appears,  then,  that  one  sentence  out  of  a  set  of  synonymous  sentences  —  a  wider  set  that 
includes  synonymous  sentences  whose  denotative  terms  are  not  everywhere  the  same  — 
has  been  selected  as  the  sentence  whose  ordinary  sentence -meaning  is  regarded  as 
expressing  the  fundamental  sentence -meaning  of  the  set.  That  the  selection  of  this  one 
sentence  is  not  arbitrary  is  seen  by  the  fact  that  it  alone  (or  synonymous  sentences 
whose  sentence -type  may  differ  but  whose  denotative  terms  are  everywhere  the  same), 
because  of  the  type  of  grammatical  structure  that  it  has,  can  appear  intact  in  a  meta¬ 
language  statement  of  the  sentence -type  represented  by  sentence  (26)  in  which  the  sub¬ 
ject,  which  refers  denotatively  to  the  concept  'the  fundamental  sentence -meaning' 
specifically  identifies  this  sentence  with  it.  It  is  obvious  that  neither  of  the  object- 
language  sentences  (25a)  or  (25b)  could  serve  grammatically  in  this  capacity.  It  is  by 
this  grammatical  criterion  that  the  sentence -type  of  a  sentence  expressing  the  funda¬ 
mental  sentence -meaning  can  be  objectively  established  as  of  a  higher  structural  lev'l 
than  the  ordinary  object -language  sentences. 

It  should  be  remarked  that  the  infinite  regress  that  one  is  forced  into  by  admission 
of  a  hierarchy  of  levels  of  language  is  not  so  ominous  as  it  appears,  since  the  gram¬ 
matical  structure  of  all  levels  has  been  assumed,  on  the  basis  of  experience  with  levels 
already  constructed,  to  be  similar. 

The  fundamental  sentence -meaning,  I  suspect,  is  what  has  often  been  called  the 
'proposition'  that  sentences  signify  or  designate.  There  has  been  much  philosophical 
discussion  concerning  the  nature  and  ontological  status  of  the  so-called  proposition.  I 
would  not  like  to  regard  the  proposition  as  a  logical  entity  that  is  incapable  of  having 
grammatical  form.  Such  a  view  would  make  a  mystery  of  what  the  proposition  is;  it 
seems  dubious  that  it  could  be  logical  in  character  if  it  cannot  be  expressed  in  language. 

Once  it  is  recognized  that  sentences  do  have  a  sentence -meaning  that  is  quite  distinct 
from  word-meaning,  that  there  is  synonymy  of  sentence,  as  well  as  synonymy  of  word, 
we  no  longer  need  the  'abstract  proposition'  to  relate  sentences  having  the  same  meaning 
to  one  another;  the  concept  of  fundamental  sentence -meaning,  which  states  what  the  syn¬ 
onymous  sentences  express,  suffices.  (Incidentally,  if  it  is  possible  that  a  sentence 
expresses  a  sentence -meaning  that  is  not  synonymous  with  that  of  any  other  sentence 
of  the  same  level,  its  fundamental  sentence -meaning  must  still  be  stated  in  the 
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metalanguage  of  that  language,  since  the  metalanguage  statement  would  utilize  denotative 
terms  to  refer  explicitly  to  the  meaning  that  the  structural  configuration  of  that  sentence 
expresses.) 

The  concept  of  'proposition'  referred  not  only  to  the  entity  that  synonymous  sentences 
within  the  same  language  system  designated;  it  stood  also  for  the  entity  that  related 
sentences  having  the  same  meaning  but  belonging  to  different  language  systems  because 
it  was  felt  that  there  must  be  some  absolute  underlying  logical  construction  that  these 
sentences  designated  which  made  possible  translation  from  one  language  to  another. 
Since  the  proposition  could  not  be  expressed  in  any  of  these  concrete  languages  because 
then  it  would  be  a  sentence  of  that  language,  it  was  deemed  to  be  not  expressible  in  any 
natural  language;  it  stood  between  any  two  languages.  It  was  supposed  that  the  predicate 
calculus  itself  was  the  underlying  logical  structure  of  all  natural  language  systems,  that 
the  predicate  calculus  played  a  role  similar  to  the  mathematical  symbolism  used  univer¬ 
sally  by  mathematicians,  each  of  whom  could  use  the  symbolism  within  his  own  native 
language.  The  inadequacy  of  the  predicate  calculus  to  fill  this  role  has  been  amply 
demonstrated. 

According  to  this  theory  of  the  semantics  of  sentence -meaning,  the  concept  of  fun¬ 
damental  sentence -meaning  is  instrumental  in  character,  not  absolute,  since  it  is  funda¬ 
mental  with  respect  to  some  language  system,  determined  implicitly  by  the  formation 
rules,  derivation  rules,  and  axioms  of  its  metalanguage.  The  applicability  of  any  logical 
system  depends  upon  how  well  it  serves  its  recognized  purpose,  whether  it  be  a  calculus 
constructed  by  known  men  or  a  naturally  developed  language  constructed  gradually  by 
many  unknown  men;  therefore  it  is  not  surprising  that  the  basic  foundations  of  the  highly 
developed  language  systems  that  these  philosophers  and  mathematicians  had  command 
of  contained  concepts  mutually  recognized  as  important. 

It  should  be  noted  that  the  basic  concepts  referred  to  by  the  fundamental  sentence - 
meanings  are  man's  constructs,  not  things  of  the  world;  they  are  man's  way  of  looking 
at  the  world,  organizing  its  material  in  ways  useful  to  him.  The  fundamental  sentence - 
meanings  refer  to  such  concepts  as  necessary  and  sufficient  conditions,  causality, 
generalization,  unexpected  occurrences,  and  possibilities.  Since  these  are  man's 
concepts,  a  pertinent  question  that  one  should  raise  about  the  possibility  of  writing  con¬ 
trastive  grammars  for  languages  in  general  and  about  the  possibility  of  mechanical 
translation  from  any  language  to  any  language,  in  particular,  is;  Are  the  fundamental 
sentence -meanings  belonging  to  each  of  the  natural  languages  that  are  expressible  by 
the  manifold  linguistic  devices  displayed  by  each  of  these  widely  differing  grammars, 
sufficiently  universal  to  serve  as  the  mapping  functions  with  which  to  coordinate  the 
sentence -types  of  any  language  system  with  those  of  any  other  language  system?  Could 
there  be  such  wide  agreement  about  what  concepts  are  fundamental  even  though  the  cul¬ 
tural  environments  of  man  are  so  diverse?  I  think  one  can  make  an  educated  guess 
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based  on  empirical  evidence  that  it  is  quite  likely  that  there  would  be  considerable  over¬ 
lap  of  the  fundamental  sentence -meanings  belonging  to  different  language  systems. 
During  the  natural  weeding -out  process  that  must  have  taken  place  during  the  develop¬ 
ment  of  a  language,  it  must  have  been  found  out  from  experience  that  knowledge  of  some 
concepts  served  man's  purposes  better  than  others;  hence  the  general  adoption  of  gram¬ 
matical  techniques  to  express  these  important  ideas  was  not  just  arbitrary;  they  were 
selected  in  terms  of  their  being  the  best  instruments  to  reach  a  goal.  The  goals  deter¬ 
mining  the  selection  processes  are  the  primary  needs  and  purposes  of  man.  And  his 
most  fundamental  need  is  to  get  along  in  his  environment.  Language  then  has  to  be  an 
instrument  that  is  capable  of  communicating  knowledge  that  members  of  a  language 
community  need  to  express  and  wish  to  express  to  one  another.  To  help  one  another  to 
adjust  to  nature,  to  control  it  to  some  extent,  to  make  plans,  men  need  to  predict  it,  to 
have  some  sense  of  order,  to  construct  laws  about  it,  and  so  forth.  Concepts  like  nec¬ 
essary  and  sufficient  conditions,  concepts  necessary  for  the  possibility  of  making  infer¬ 
ences,  concepts  like  negation  and  affirmation  that  permit  the  possibility  of  distinguishing 
between  truth  and  falsehood  would  all  appear  universally  needed  by  a  society  that  is 
intelligent  enough  to  create  and  use  such  an  abstract  and  complicated  system  of  conven¬ 
tional  signs  as  a  language.  Also,  as  our  biological  knowledge  of  the  organization  of  the 
human  brain  increases,  it  may  at  some  future  time  be  shown  that,  in  the  evolutionary 
process  of  adaptation,  the  human  brain  became  structured  in  such  a  way  that  it  organ¬ 
izes  the  raw  material  of  the  senses  in  specific  ways. 

An  interesting  project  would  be  a  comparison  in  the  sophistication  of  the  concepts 
involved  in  fundamental  sentence -meaning  —  all  languages  possess  fundamental  sentence - 
meanings,  if  not  the  same  ones  —  between  the  language  of  a  highly  civilized  society  and 
the  language  of  what  is  considered  by  anthropologists  to  be  a  very  primitive  society. 

Elinor  K.  Charney 
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A.  THE  NATURE  OF  A  SEMANTIC  THEORY 

As  an  investigation  in  semantic  meta-theory, 1  this  report  describes  the  abstract 
form  of  a  semantic  theory  (or  description)  of  a  natural  language:  the  form  of  the  dic¬ 
tionary  entries,  the  rules  that  project  the  lexical  meanings  expressed  in  dictionary 
entries  onto  sentences  in  the  form  of  sentential  meanings,  the  relation  of  such  rules 
and  dictionary  entries,  both  to  each  other  and  to  the  grammar  of  the  language,  and  the 
notion  'sentence  meaning'.  A  meta-theory  for  semantic  theories  is  needed  to  inform 
the  field  linguist  of  the  types  of  semantic  facts  for  which  to  look,  the  most  revealing 

and  succinct  way  to  arrange  them,  and  that  which  can  be  said  about  the  semantic  struc- 

2 

ture  of  the  language  based  upon  such  facts  in  such  an  arrangement. 

In  our  paper  "The  Structure  of  a  Semantic  Theory"  (henceforth  we  shall  refer 

to  this  publication  as  "SST"),  Fodor  and  I  show  that  a  semantic  theory  of  a  natural 

language  has  as  its  fundamental  aim  the  construction  of  a  system  of  rules  which 

3 

represents  that  which  a  fluent  speaker  knows  about  the  semantic  structure  of  his 
language  which  permits  him  to  understand  its  sentences.  The  idea  behind  this  con¬ 
ception  of  a  semantic  theory  is  that  such  knowledge  takes  the  form  of  recursive  rules 
that  enable  the  speaker  to  compose,  albeit  implicitly,  the  meaning  of  any  sentence  out  of 
the  familiar  meanings  of  its  familiar  elementary  components.  This  idea  has  the  following 
two-part  rationale.  First,  the  most  salient  and  impressive  fact  about  linguistic  com¬ 
petence  is  that  a  fluent  speaker  can  understand  a  sentence  of  his  language  even  though 

4 

he  has  never  previously  encountered  it.  In  principle,  he  can  understand  any  of  the 
infinitely  many  sentences  of  his  language.  But,  since  at  any  time  in  his  life  the  speaker 
can  have  encountered  only  an  exceedingly  small,  finite  subset  of  the  infinite  set  of  sen¬ 
tences  of  his  language,  and,  moreover,  his  storage  capacity  is  finite,  we  can  conclude 
that  the  speaker's  knowledge  of  the  semantic  structure  of  his  language  takes  the  form 
of  a  finite  set  of  recursive  rules  that  fix  a  meaning  for  each  of  the  infinitely  many  sen¬ 
tences  of  his  language.  Second,  since  a  speaker's  understanding  of  the  sentences  of 
his  language  also  depends  on  his  knowing  sufficiently  the  meanings  of  their  elementary 
components,  the  lexical  items  in  the  vocabulary  of  the  language,  we  can  conclude  that 
the  meaning  that  the  rules  fix  for  a  sentence  must  be  a  compositional  function  of  the 
antecedently  known  meanings  of  its  elementary  components,  the  lexical  items  appearing 

*This  work  was  supported  in  part  by  the  National  Science  Foundation  (Grant  G- 16526); 
in  part  by  the  National  Institutes  of  Health  (Grant  MH-04737-02);  and  in  part  by  the 
U.  S.  Air  Force  (Electronics  Systems  Division)  under  Contract  AF19(628)-2487. 
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in  it.  Hence,  a  semantic  theory  of  a  natural  language  must  contain  rules  that  explicate 
the  compositional  function  that  determines  how  a  speaker  utilizes  the  meanings  of  the 
lexical  items  in  a  sentence  to  understand  what  that  sentence  means.  If  a  semantic  theory 
is  not  adequate  to  explicate  this  function,  it  cannot  represent  the  speaker's  knowledge 
of  the  semantic  structure  of  his  language . 

In  SST,  we  proposed  that  a  semantic  theory  consist  of  two  components.  First,  a 
dictionary  that  provides  an  entry  for  each  lexical  item  of  the  language  which,  in  some 
sense,  gives  the  meaning  of  the  entry.  Second,  a  finite  set  of  what  we  called  "projection 
rules"  that  use  lexical  information  supplied  by  the  dictionary  entries  for  the  lexical 
items  in  a  sentence  and  information  about  the  sentence's  syntactic  structure  supplied  by 
its  grammatical  description  in  order  to  assign  a  semantic  interpretation  to  the  sentence. 

Since  information  about  a  sentence's  syntactic  structure  will  be  needed  to  assign  a 
semantic  interpretation  to  it,  we  found  it  convenient  to  let  the  output  of  a  grammar  be 
the  input  to  a  semantic  theory.  In  this  way,  each  sentence  considered  by  a  semantic  the¬ 
ory  is  represented  as  a  concatenation  of  morphemes  whose  constituent  structure  is  given 
in  the  form  of  a  hierarchical  categorization  of  the  syntactical  parts  of  the  concatenation.5 
The  sentence  The  boys  like  candy  is  represented  by  the  concatenation  of  morphemes 
the+boy+s+like+candy  which  is  hierarchically  categorized  as  follows:  the  whole  string 
is  categorized  as  a  sentence  at  the  highest  level  of  the  hierarchy;  the+boy+s  is  catego¬ 
rized  as  a  noun  phrase,  and  llke+candy  is  categorized  as  a  verb  phrase  at  the  next  level 
of  the  hierarchy;  the  is  categorized  as  an  article;  boy+s  is  categorized  as  a  noun:  like . 
as  a  verb;  and  candy,  as  a  noun;  and  so  forth  on  the  next  and  lower  levels  of  the  hier¬ 
archy.  Following  Chomsky,  we  can  represent  such  a  categorization  in  the  form  of  a 
labelled  tree  diagram  in  which  the  notion  'the  sequence  of  morphemes  m  belongs  to  the 
category  c'  is  formalized  by  the  notion  'm  is  traceable  back  to  a  node  labelled  c'.5 
Calling  this  element  of  the  structural  description  that  the  grammar  assigns  to  a  sentence 
its  "constituent  structure  characterization,"  we  have  the  constituent  structure  charac¬ 
terization  of  The  boys  like  candy  given  in  Fig.  XIV-1.7  Therefore  the  input  to  a 


Fig.  XIV-1. 
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semantic  theory  is  sentences  thus  embedded  within  their  constituent  structure  charac¬ 
terizations,  together  with  any  further  grammatical  information  that  an  optimal  grammar 

g 

will  supply  about  them. 

A  semantic  theory  takes,  one  after  another,  the  discrete  outputs  of  the  grammar 
and  operates  on  them  in  a  way  that  matches  what  a  fluent  speaker  does  to  obtain  his 
understanding  of  the  sentences.  This  way  of  conceiving  of  the  operation  of  a  semantic 
theory  provides  a  basis  for  explaining  how  a  fluent  speaker  applies  his  stock  of  lexical 
information  to  apprehend  the  meaning  of  each  new  syntactically  well -formed  arrange¬ 
ment  of  lexical  items  that  he  encounters.  Clearly,  a  semantic  theory  can  provide  such 
an  explanation  only  if  its  projection  rules  assign  each  sentence  a  semantic  interpretation 
that  represents  the  manner  in  which  a  fluent  speaker  employs  the  syntactic  structure 
of  the  sentence  to  determine  its  meaning  as  a  function  of  the  meanings  of  the  sentence 's 
lexical  items.  Hence,  we  must  consider  just  what  a  semantic  interpretation  ought  to 
tell  us  about  a  sentence. 

The  semantic  interpretations  produced  by  a  semantic  theory  for  the  sentences  of  a 
language  constitute  the  theory's  description  of  the  language's  semantic  structure.  Since 
a  fluent  speaker's  knowledge  of  the  semantics  of  his  language  manifests  itself  in  his 
verbal  performance,  a  semantic  interpretation  of  a  sentence  ought  to  tell  us  whatever 
the  speaker  implicitly  knows  about  the  sentence's  semantic  structure  that  enables  him 
to  carry  on  his  verbal  performances.  Thus,  the  fundamental  question  we  asked  in  SST 
about  the  speaker's  verbal  performance  was  "What  does  the  speaker  do  that  manifests 
his  knowledge  of  the  semantic  structure  of  his  language?"  We  answered  that  he  differ¬ 
entiates  sentences  that  are  semantically  acceptable  from  those  that  are  semantically 
anomalous,  he  recognizes  ambiguities  stemming  from  semantic  relations  in  a  sentence, 
he  detects  semantic  relations  between  expressions  and  sentences  of  different  syntactic 
type  and  morphemic  constitution,  and  so  forth.  On  the  basis  of  this  answer,  we  con¬ 
cluded  that  the  semantic  interpretations  produced  by  a  semantic  theory  must  mark  as 
semantically  acceptable  and  anomalous  those  sentences  that  the  speaker  differentiates 
as  acceptable  and  anomalous,  mark  as  semantically  ambiguous  those  sentences  that  the 
speaker  regards  as  such,  mark  as  semantically  related  in  such-and-such  a  fashion  just 
those  n-tuples  of  expressions  and  just  those  n-tuples  of  sentences  that  the  speaker 
detects  as  so  related,  and  so  forth.  Otherwise,  we  argued  that  the  semantic  theory  cannot 
claim  to  represent  the  speaker's  semantic  knowledge.  For  example,  a  semantic  theory 
of  English  would  have  to  produce:  a  semantic  interpretation  for  The  bank  is  the  scene  of 
the  crime  that  marks  it  as  semantically  ambiguous;  semantic  interpretations  for  the  sen¬ 
tences  He  paints  with  silent  paint  and  Two  pints  of  the  academic  liquid  1 9  that  mark  them 
as  semantically  anomalous;  semantic  interpretations  for  He  paints  silently  and  Two  pints 
of  the  muddy  liquid  1  that  mark  them  as  semantically  acceptable;  and  semantic  interpre¬ 
tations  that  mark  the  sentences  Eye -doctors  eye  blonds,  Oculists  eye  blonds,  Blonds  are 
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eyed  by  eye-doctors,  etc.  as  paraphrases  of  each  other  but  mark  Eye -doctors  eye  what 
gentlemen  prefer  as  not  a  paraphrase  of  any  of  these  sentences. 

To  finish  describing  the  conception  of  a  semantic  theory  developed  in  SST,  we  need 
only  characterize  the  notions  'semantic  interpretation 'dictionary  entry and  'projec¬ 
tion  rule'.  However,  it  is  important  to  point  out  that  our  characterization  of  these 
notions  must  be  such  that  the  semantic  interpretations,  dictionary  entries,  and  projec¬ 
tion  rules  of  a  semantic  theory  of  any  natural  language  will  be  specified  formally,  i.  e. , 
the  application  of  the  projection  rules  will  be  determinable  solely  on  the  basis  of  the 
shapes  of  the  symbols  in  the  strings  to  which  they  apply  and  the  operations  that  the  rules 
affect  upon  these  strings  will  be  mechanical.  The  degree  to  which  a  semantic  theory 
is  not  formally  specified  is  the  degree  to  which  the  fluent  speaker's  linguistic  knowl¬ 
edge,  or  his  intelligence  and  ingenuity,  are  needed  to  determine  whether  or  not  the  pro¬ 
jection  rules  apply  in  certain  cases  and  to  determine  what  operations  these  rules  affect 
when  they  apply.  Hence,  to  require  that  a  semantic  theory  of  a  natural  language  be  fully 
formal  is  just  to  require  that  it  fully  explicate  the  speaker's  knowledge  of  the  semantic 
structure  of  the  language. 

Let  us  start  with  the  notion  'dictionary  entry'.  As  we  have  seen  above,  a  semantic 
theory  is  intended  to  reconstruct  the  process  by  which  a  speaker  projects  a  meaning  for 
a  sentence  from  the  meanings  of  the  lexical  items  appearing  in  it.  Thus,  within  a  seman¬ 
tic  theory  the  dictionary  entries  play  the  special  role  of  providing  the  basis  from  which 
the  projection  rules  of  the  theory  derive  the  semantic  interpretations  that  they  assign 
sentences.  Our  characterization  of  the  notion  'dictionary  entry'  must  thus  be  such  that 
in  it  we  have  a  normal  form  for  the  dictionary  entries  which  enables  us  to  represent 
lexical  information  in  a  maimer  that  is  both  formal  and  sufficient  in  content  to  provide 
a  complete  basis  from  which  the  projection  rules  can  operate. 

For  the  vast  majority  of  cases,10  a  dictionary  entry  is  a  set  consisting  of  a  finite 
number  of  sequences  of  symbols,  each  sequence  consisting  of  an  initial  subsequence  of 
syntactic  markers,  followed  by  a  subsequence  of  semantic  markers,  then,  optionally,  one 
distinguisher,  and,  finally,  a  selection  restriction.  Dictionary  entries  can  be  represented 
in  the  form  of  tree  diagrams,  such  as  that  shown  in  Fig.  XIV-2,  in  which  each  sequence 

in  the  entry  for  a  lexical  item  m.  appears  as  a  distinct  path  rooted  at  the  lexical  item 

11  * 

m^.  As  illustrated  in  Fig.  XIV-2,  semantic  markers  are  represented  enclosed  within 
parentheses,  the  distinguishers  are  represented  enclosed  within  brackets,  and  the  selec¬ 
tion  restrictions  are  represented  within  angles.  Each  complete  path,  each  sequence, 
represents  a  distinct  sense  of  the  lexical  item  in  whose  entry  it  appears.  Thus,  in 
Fig.  XIV-2  the  lexical  item  bachelor  is  represented  as  having  four  distinct  senses. 

Semantic  markers  are  the  formal  elements  that  a  semantic  theory  employs  to 
express  semantic  relations  of  a  general  nature.  In  contrast,  distinguishers  are  the 
formal  elements  employed  to  represent  that  which  is  idiosyncratic  about  the  meaning 
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bachelor 


( Not-young) 


( Never-married) 


<Q,> 


[Knight 
who  is 
serving 
under  the 
standard 
of  another] 


[OneFfaving  the 
academic  degree 
conferred  for 
completing  the 
first  four  years 
of  college] 


<n3> 


(Animal) 


(Young) 


[Fur  seal  when 
without  a  mate 
during  breeding 
time] 


<  v 


<fi2> 


Fig.  XIV-2. 


of  a  lexical  item.  A  distinguisher  serves  to  distinguish  a  lexical  item  from  those  that 
are  closest  to  it  in  meaning.  Thus,  a  semantic  marker  found  in  the  path  of  a  certain 
lexical  item  will  also  be  found  in  the  paths  of  many  other  lexical  items  throughout  the 
dictionary,  whereas  a  distinguisher  found  in  the  path  of  a  certain  lexical  item  will,  with 
very  few  exceptions,  not  be  found  anywhere  else  in  the  dictionary.  This  difference  can 
be  more  fully  appreciated  if  one  compares  the  consequences  of  eliminating  a  semantic 
marker  from  a  dictionary  with  the  consequences  of  eliminating  a  distinguisher;  in  the 
former  case  indefinitely  many  semantic  relations  between  the  expressions  of  the  lan¬ 
guage  which  were  marked  by  the  eliminated  semantic  marker  would  no  longer  be  marked, 

whereas  in  the  latter  case  only  the  few  distinctions  in  sense  which  were  marked  by  the 

12 

eliminated  distinguisher  would  no  longer  be  marked. 

A  lexical  item  is  ambiguous  if  and  only  if  its  entry  contains  at  least  two  distinct 
paths.  It  is  clear,  moreover,  that  ambiguity  at  the  lexical  level  is  the  source  of  seman¬ 
tic  ambiguity  at  the  sentence  level.  Thus,  a  necessary,  though  not  sufficient,  condition 
for  a  syntactically  unambiguous  sentence  to  be  semantically  ambiguous  is  that  it  contain 
an  ambiguous  lexical  item.  For  example,  the  source  of  the  semantic  ambiguity  of  the 
sentence  He  likes  to  wear  a  light  suit  in  the  summer  is  the  lexical  ambiguity  of  the 
word  light.  Since  an  adequate  dictionary  entry  for  a  lexical  item  must  mark  every  one 
of  its  ambiguities,  the  dictionary  entry  for  light  is  required  to  represent  this  lexical 
item  as  branching  into  one  path  containing  the  semantic  marker  (Color)  but  not  (Weight) 
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and  another  containing  the  semantic  marker  (Weight)  but  not  (Color). 

However,  an  ambiguous  lexical  item  in  a  syntactically  unambiguous  sentence  is  not 
a  sufficient  condition  for  that  sentence  to  be  semantically  ambiguous.  For  example, 
the  sentence  The  stuff  is  light  enough  to  carry,  though  it  contains  the  ambiguous  lexical 
item  light,  is  not  understood  according  to  the  sense  in  which  light  enough  to  carry  means 
light  enough  in  color  to  be  carried.  Thus,  that  the  occurrence  of  an  ambiguous  lexical 
item  in  a  sentence  does  not  ipso  facto  make  that  sentence  semantically  ambiguous  implies 
that  the  grammatical  relations  in  the  sentence  and/or  the  meanings  of  the  other  con¬ 
stituents  prevent  this  item  from  bearing  more  than  one  of  its  readings  in  its  role  as  a 
constituent  of  the  sentence.  This  shows  that  selection  of  some  senses  and  exclusion  of 
others  occur  as  a  result  of  the  other  constituents  of  the  sentence.  Such  selection  is  of 
fundamental  importance  because,  together  with  lexical  ambiguity,  it  determines  whether 
or  not  a  sentence  is  anomalous,  whether  a  sentence  is  semantically  unambiguous  or 
semantically  ambiguous,  and  all  other  semantic  properties  of  sentences  which  we  want 
a  semantic  theory  to  mark. 

Therefore,  besides  containing  syntactic  markers  that  determine  the  part -of -speech 
classification  for  a  lexical  item,  semantic  markers  that  represent  the  semantic  proper¬ 
ties  that  the  lexical  item  shares  with  many  other  lexical  items,  and  (optionally)  a  dis- 
tinguisher  that  fixes  its  idiosyncratic  features,  a  path  for  a  lexical  item  can  contain 
selection  restrictions  that  determine  the  combinations  into  which  the  lexical  item  can 
enter  and  the  sense(s)  that  it  bears  in  those  combinations.  The  formal  representation 
of  selection  restrictions  can  be  regarded  as  an  explication  of  such  information  as 
The  Shorter  Oxford  English  Dictionary's  qualification  that  the  word  honest  when  applied 
to  persons  means  "of  good  moral  character,  virtuous,  upright"  and  applied  to  women 
is  ambiguous  between  this  sense  and  the  sense  of  chaste.  In  SST  we  wrote: 


For  our  reconstruction,  we  shall  use  left  and  right  angles  enclosing  a 
Boolian  function  of  syntactic  or  semantic  markers.  Such  configurations 
of  symbols  will  be  affixed  to  the  terminal  element  of  a  path  (either  the 
distinguisher  or  the  last  semantic  marker  if  there  is  no  distinguisher) 
and  will  be  construed,  relative  to  the  projection  rules,  as  providing  a 
necessary  and  sufficient  condition  for  a  semantically  acceptable  combi¬ 
nation.  Thus,  for  example,  the  angle-material  affixed  to  the  path  of  a 
modifier  determines  the  applicability  of  that  path  of  the  modifier  to  a 
sense  of  a  nominal  head,  in  particular,  a  path  in  the  dictionary  entry 
for  honest  will  be:  honest  —  adjective  — ■  (Evaluative)  —  (Moral)  —  [inno¬ 
cent  of  illicit  sexual  intercourse]  <(Human)  &  (Female)>.  This  is  to 
be  construed  as  saying  that  an  adjectival  occurrence  of  honest  receives 
the  interpretation  (Evaluative)  “  (Moral)  —  [Innocent  of  illicit  sexual 
intercourse]  just  in  case  the  head  it  modifies  has  a  path  containing  both 
the  semantic  marker  (Human)  and  the  semantic  marker  (Female).  How 
in  actual  practice  a  semantic  theory  utilizes  angle-material  to  determine 
selection-exclusion  relations  to  obtain  the  correct  semantic  interpreta¬ 
tions  for  sentences  can  only  be  made  clear  by  a  statement  of  the 

projection  rules.  ^ 
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The  next  notion  for  us  to  explain  is  the  notion  'projection  rule'.  Let  us  suppose  that 
an  English  grammar  provides  a  semantic  theory  with  the  input  sentence  The  boys  like 
candy,  together  with  the  constituent  structure  characterization  as  given  in  Fig.  XIV-1. 
The  first  step  that  the  theory  performs  in  the  process  of  assigning  a  semantic  interpre¬ 
tation  to  this  sentence  will  be  to  associate  with  each  of  the  lexical  items  in  the  sentence, 
i.  e. ,  the,  boy,  s,  like,  and  candy,  all  and  only  the  paths  from  their  dictionary  entries 
that  are  compatible  with  the  syntactic  categorization  that  the  lexical  items  are  given  in 
the  constituent  structure  characterization.  This  is  the  first  point  at  which  a  significant 
use  of  grammatical  information  is  made,  but  by  no  means  the  last.  The  semantic  theory 
works  as  follows:  a  path  in  the  dictionary  entry  for  the  lexical  item  rm  is  assigned  to 

the  finite,  non-null  set  of  paths  P*  which  is  associated  with  the  occurrence  of  m.  in  the 

J  J 

constituent  structure  characterization  just  in  case  that  path  contains  syntactic 

markers  that  attribute  to  im  the  same  part -of -speech  classification  that  it  has  on  the 

constituent  structure  characterization  d..  Thus,  the  lexical  item  m.  is  associated  with 

i  1  j  1  j  i  a 

the  set  of  paths  P,;  m_  is  associated  with  P,,  ...,  and  m  is  associated  with  P_. 

12  2  n  n 

Referring  to  Fig.  XIV-1,  we  picture  the  result  of  this  step  as  converting  the  diagram 

into  one  in  which  the  is  associated  with  the  set  of  paths  P*.,  boy  is  associated  with  pL 

-  i  i  1  -  i  c 

_s  is  associated  with  P^,  like  is  associated  with  P^,  and  candy  is  associated  with  Pg 

(though  no  other  change  is  made).  Thus,  for  example,  P*  contains  paths  representing 
each  of  the  senses  that  candy  has  as  a  noun  but  none  of  the  paths  representing  its  senses 
as  a  verb,  e.g. ,  We  will  candy  fruits  tomorrow,  The  fruits  candy  easily,  etc.  This  rule 
that  associates  senses  with  the  occurrences  of  lexical  items  in  constituent  structure 
characterizations  is  thus  our  first  projection  rule.  But  since  it  is  in  many  ways  atyp¬ 
ical,  we  shall  continue  our  discussion  of  projection  rules  as  if  this  were  not  such  a  rule. 

There  are  two  types  of  projection  rules:  type  1  projection  rules  and  type  2  projec¬ 
tion  rules.  The  job  of  type  1  projection  rules  is  to  effect  a  series  of  amalgamations  of 
paths,  proceeding  from  the  bottom  to  the  top  of  a  constituent  structure  characterization, 
by  embedding  paths  into  each  other  to  form  a  new  path,  the  amalgam.  The  amalgam  is 
then  assigned  to  the  set  of  paths  associated  with  the  node  that  immediately  dominates 
the  sets  of  paths  from  which  the  amalgamated  paths  were  drawn.  The  amalgam  thus 
provides  one  of  the  ways  to  read  the  sequence  of  lexical  items  that  the  node  dominates. 

In  this  manner,  a  set  of  readings  is  provided  for  every  sequence  of  lexical  items  dom¬ 
inated  by  a  syntactic  marker  in  the  constituent  structure  characterization,  until  the 
highest  syntactic  marker  'S'  is  reached  and  associated  with  a  set  of  readings  for  the 
whole  sentence.  The  operation  of  amalgamation  is  that  of  joining  with  one  another  one 
path  from  each  of  the  n  different  sets  of  paths  dominated  by  a  syntactic  marker  SM  to 
form  a  composite  path  to  be  a  member  of  the  set  of  paths  associated  with  the  node  that 
the  syntactic  marker  SM  labels.  The  joining  of  a  pair  of  paths  occurs  just  in  case  one 
of  the  paths  satisfies  the  selection  restrictions  that  the  other  contains.  If  the  syntactic 
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marker  SM  dominates  just  the  sets  of  paths  Pj,  P^,  ....  P*  and  P*  contains  paths, 
pi  contains  k,  paths,  ....  P*  contains  k_  paths,  then  the  set  of  paths  which  is  asso- 
ciated  with  the  dominating  marker  SM  contains,  at  most  (kj  •  k2  •  .  •  •  •  km)  members 
and  possibly  zero  members  if  selection  restrictions  prevent  every  possible  amalgama¬ 
tion  from  forming.  Each  path  that  is  in  the  set  assigned  to  SM  is  called  'a  reading  for 
the  lexical  string  that  SM  dominates  in  the  constituent  structure  characterization  d^'. 
The  number  of  readings  that  is  thus  allotted  to  a  string  of  lexical  items  determines  its 
degree  of  semantic  ambiguity:  A  string  with  no  readings  is  anomalous,  a  string  with 
exactly  one  reading  is  unambiguous,  and  a  string  with  two  or  more  readings  is  seman¬ 
tically  ambiguous  two  or  more  ways. 

An  example  of  a  projection  rule  of  type  I  is: 

(Rl)  Given  two  paths  associated  with  nodes  branching  from  the  same  node  labelled  SM, 
one  of  the  form, 

Lexical  String,  —  syntactic  markers  of  head  —  (a^)  ■*  (a2)  —  ...  —  (an)  —  [l]  <Oj  > 
and  the  other  of  the  form, 

Lexical  String,  —  syntactic  markers  of  the  modifier  of  the  head  —  (bj)  ■*  (b2)  —  . . , 
-(bm)-[2]<02> 

such  that  the  string  of  syntactic  or  semantic  markers  of  the  head  has  a  substring 
<r  which  satisfies  <n2>,  then  there  is  an  amalgam  of  the  form. 

Lexical  String,  +  Lexical  String j  —  dominating  node  marker  SM  —  (aj)  —  (a2)  —  . . . 
-  (an)  -  (bj)  -  (b2)  -  . . .  -  (bm)  -  [[2][1]]  <Oj>. 

where  any  b^  is  null  just  in  case  there  is  an  a^  such  that  b^  =  a^,  and  [[2][  1  ]]  is 
simply  [l]  just  in  case  [2]  =  [l ].  This  amalgam  is  assigned  to  the  set  of  paths  asso¬ 
ciated  with  the  node  labelled  SM  that  dominates  Lexical  String2  +  Lexical  Stringy 

(Rl)  explicates  the  process  of  attribution,  i.  e. ,  the  process  of  creating  a  new  semantic 
unit  compounded  from  a  modifier  and  head  whose  semantic  properties  are  those  of  the 
head,  except  that  the  meaning  of  the  compound  is  more  determinate  than  the  head's  by 
virtue  of  the  semantic  information  contributed  by  the  modifier.  The  erasure  clause  at 
the  end  of  the  statement  of  (Rl)  is  included  to  avoid  pointlessly  duplicating  the  semantic 
markers  and  distingulshers  in  the  path  for  a  compound  expression.  The  modifier- 
head  relation  will  be  explicated  by  the  grammar  of  the  language  so  that  the  constituent 
structure  characterization  of  a  sentence  will  mark  all  cases  of  this  relation  that  are 
found  in  it.  In  English,  as  well  as  many  other  languages,  instances  of  modifier-head 
relations  are:  adjective -noun  modification,  adverb-verb  modification,  adverb -adjective 
modification,  etc.  Here,  then,  is  another  point  at  which  grammatical  information  is 
utilized  in  the  process  of  assigning  a  semantic  interpretation  to  a  sentence. 

An  example  of  an  amalgamation  produced  by  (Rl)  is  the  joining  of  the  path  colorful 
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—  adjective  —  (color)  —  [abounding  in  contrast  or  variety  of  bright  colors]  <  (Physical 
object)  v  (Social  activity)  >  and  the  path  ball  —  noun  —  (Physical  object)  —  [Of  globular 
shape]  to  produce  the  new  compound  path  colorful  +  ball  —  noun  —  (Physical  object)  — 
(color)  —  [[abounding  in  contrast  or  variety  of  bright  colors][of  globular  shape]].  An 
example  of  an  amalgamation  that  is  prevented  by  a  selection  restriction  is  that  of  the 
path  colorful  —  adjective  —  (Evaluative)  —  [of  distinctive  character,  vividness,  or  pic¬ 
turesqueness]  <  (Aesthetic  Object)  v  (Social  Activity)  >  with  the  path  for  ball  just  given 
above.  This  possible  amalgamation  is  precluded  because  the  selection  restriction  in 
the  path  of  the  modifier  requires  that  this  path  be  joined  only  with  paths  of 
heads  that  contain  either  the  semantic  marker  (Aesthetic  object)  or  the  seman¬ 
tic  marker  (Social  activity),  and  this  path  of  ball  contains  neither  one  of  these 
semantic  markers. 

Type  2  rules  work  differently  and  are  best  explained  after  we  explain  the 
concept  of  a  semantic  interpretation  of  a  sentence. 

A  semantic  theory  receives  more  than  one  constituent  structure  characteri¬ 
zation  for  a  sentence  if  that  sentence  is  syntactically  ambiguous.  Figures  XIV-3 
and  XTV -4  show  the  two  constituent  structure  characterizations  for  the  syntac¬ 
tically  ambiguous  sentence  I  like  little  boys  and  girls. 


Fig.  XIV-3. 


Let  dj,  d2>  ....  dn  (for  n  >  1)  be  the  constituent  structure  characterizations  that  the 
grammar  provides  for  the  sentence  S.  We  shall  define  the  "semantic  interpretation 
of  S"  to  be  (1)  the  conjunction  4*d.  e  4<d,  t  ...  t  i|*i  of  the  semantic  interpretations  of 
the  n  constituent  structure  characterizations  of  S,  and  (2)  the  statements  about  S  that 
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follow  from  the  definition  schema: 

(O)  S  is  fully  X  if  and  only  if  S  is  X  on  every  dj.  The  semantic  interpretation  4<dj 
of  the  constituent  structure  characterization  dj  of  S  is  (1)  the  constituent  structure  char¬ 
acterization  dj  each  node  of  which  is  associated  with  its  full  set  of  readings,  i.  e. ,  every 
reading  that  can  belong  to  the  set  on  the  basis  of  the  dictionary  entries  and  the  projection 


Fig.  XIV-4. 


rules  does  belong  to  it,  and  (2)  the  statements  about  S  that  follow  from  (1)  together  with 
the  definitions: 

(Dl)  S  is  semantically  anomalous  on  d.  if  and  only  if  the  set  of  paths  associated  with 
the  node  labelled  'S'  in  d^  contains  no  members. 

(D2)  S  is  semantically  unambiguous  on  d.  if  and  only  if  the  set  of  paths  associated  with 
the  node  labelled  'S'  in  d^  contains  exactly  one  member. 

(D3)  S  is  n-ways  semantically  ambiguous  on  dt  if  and  only  if  the  set  of  paths  associated 
with  the  node  labelled  'S'  in  d.  contains  exactly  n  members  (n  £  2). 

(D4)  S,  and  S?  are  paraphrases  on  a  reading  with  respect  to  their  characterizations  di 
and  d.  if  and  only  if  the  set  of  paths  associated  with  the  node  labelled  'S'  in  d^  and 
the  set  of  paths  associated  with  the  node  labelled  'S'  in  d^  have  a  reading  in  common. 

(D5)  Sj  and  S2  are  full  paraphrases  with  respect  to  their  characterizations  d^  and  if 
and  only  if  the  set  of  paths  associated  with  the  node  labelled  'S'  in  d^  and  the  seT of 
paths  associated  with  the  node  labelled  'S'  in  d..  have  exactly  the  same  membership. 

Since  these  definitions  are  self-explanatory,  we  can  now  return  to  our  account  of  the 
projection  rules  and  explain  the  concept  of  a  type  2  projection  rule. 

A  grammar  employs  two  types  of  syntactic  rules  to  achieve  its  aim  of  assigning  the 
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correct  constituent  structure  characterization  to  each  sentence  of  the  language . 1  *  The 

first  type  are  rules  that  develop  single  symbols  by  rewriting  operations  that  are 

restricted  to  scanning  the  linear  context  of  the  symbol  to  be  developed  for  information 

used  to  determine  the  rule's  applicability.  Such  rules  construct  constituent  structure 

characterizations  such  as  those  in  Figs.  X1V-1,  XIV-3,  and  XIV-4  in  somewhat  the 

following  manner:  the  first  rule  rewrites  the  initial  symbol  S  (standing  for  "sentence") 

as  NP+VP  (which  categorizes  a  noun  phrase -verb  phrase  sequence  as  a  sentence),  then 

a  rule  can  be  used  to  rewrite  NP  as  either  NPg^  or  NP^,  then  other  rules  can  be 

used  to  rewrite  VP  as  either  V+NP  or  V^r  or  be+Pred,  still  other  rules  to  rewrite 

NP„{ _ as  T+N,  T  as  the,  N  as  boy  (or  man,  coat,  mouse,  etc.),  and  so  forth. 17  But  the 

sing  ■'  *  -  -  ■ 

scanning  limitation  on  these  rules  makes  the  second  type  of  rule  necessary,  for  it  has 

been  shown  that  a  grammar  can  assign  constituent  structure  characterizations  correctly 

only  if  some  of  its  rules  use  information  about  the  derivational  history  of  sentences  in 

18 

order  to  determine  their  applicability.  Thus,  in  addition  to  such  rewrite  rules,  gram¬ 
mars  contain  what  are  called  "transformational  rules,"  rules  that  operate  on  entire 
constituent  structure  characterizations,  or  any  of  their  parts,  and  map  labelled  trees 
onto  labelled  trees,  thus  transforming  simpler  sentences  into  more  complex  ones  and 
assigning  the  transformed  sentence  a  constituent  structure  characterization.  Transfor¬ 
mational  rules  perform  the  task  of  explicating  the  syntactic  relations  between  sentence 
types  in  the  language.  Thus,  they  show  the  syntactic  relation  between  such  pairs  as 
The  boys  play  games  and  Games  are  played  by  the  boys.  The  men  fly  planes  and  What 
do  the  men  fly? ,  etc. ,  by  showing  how  the  latter  member  of  each  pair  is  constructed 
out  of  the  former. 

Type  2  projection  rules  are  intended  to  explicate  the  manner  in  which  transformational 
rules  preserve  or  change  meaning.  It  has  often  been  observed  by  linguists  who  work 
with  transformational  rules  that  the  sentence  resulting  from  the  application  of  a  trans¬ 
formational  rule  to  a  set  of  source  sentences  iB  related  in  meaning  to  these  source  sen- 

19 

tencea  in  a  definite,  systematic  way.  The  employment  of  type  2  rules  is  intended  to 
reveal  the  facts  of  language  that  underlie  this  observation. 

We  can  characterize  a  type  2  projection  rule  as  a  rule  that  produces  a  semantic 
interpretation  4*^  for  the  constituent  structure  characterization  d.  that  has  been 
constructed  by  the  operation  of  the  transformation  T  out  of  the  set  of  constituent  struc¬ 
ture  characterizations  dj,  d 2»  . . . ,  dR.  A  type  2  projection  rule  operates  on  the  set 
of  semantic  interpretations  4*3  4<d2,  ....  4*dn  and  the  transformation  T  to  produce 

the  semantic  interpretation  4*3..  Type  2  projection  rules  should  assign  semantic  inter¬ 
pretations  in  such  a  way  as  to  reconstruct  the  manner  in  which  the  meaning  of  the  sen¬ 
tence  that  was  constructed  by  T  is  a  function  of  the  meanings  of  each  of  the  sentences 
that  were  used  by  T  in  its  construction. 

J.  J.  Katz 
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Footnotes 

1.  A  full  discussion  of  the  nature  of  a  semantic  meta-theory  will  be  found  in 
J.  J.  Katz  and  J.  A.  Fodor,  The  structure  of  a  semantic  theory  (to  be  published  in 
Language);  Readings  jn  the  Philosophy  of  Language,  edited  by  J.  A.  Fodor  and 
J.  J.  Katz  (Prentice-Hall,  Inc.,  Englewoo3"Cliffs7  J. ,  in  press). 

2.  This  is  not  meant  to  imply  that  the  conception  of  a  semantic  theory  which 
is  outlined  here  conceives  of  such  a  theory  as  the  product  of  a  discovery  pro¬ 
cedure. 


3.  Here,  I  anticipate  some  such  objection  as  the  following:  "How  can  you 
say  a  fluent  speaker  ’knows'  something  if  he  cannot  say  what  it  is  that  you 
claim  he  knows?"  I  do  not  think  anything  hangs  on  my  having  the  word  "know," 
and  so  if  anyone  insists  upon  this  objection,  I  shall  give  up  the  word  rather 
than  become  embroiled  in  a  lexical  quibble.  I  intend  to  convey  the  idea  that 
the  fluent  speaker  has  acquired  the  necessary  means  for  performing  a  task  whose 
character  compels  us  to  admit  that  its  performance  results  from  the  applica¬ 
tion  of  rules.  Among  the  reasons  that  compel  us  to  make  this  admission  is 
that  cited  by  Miller,  Pribram,  and  Galanter,  viz,  that  the  task  of  understanding 
any  twenty-word  sentence  is  one  that  a  fluent  speaker  can  perform,  yet  the  num¬ 
ber  of  twenty- word  sentences  is  10^,  while  the  number  of  seconds  in  a  cen¬ 
tury  is  only  3.  15  X  10^.  Cf.  G.  A.  Miller,  K.  Pribram,  and  E.  Galanter,  Plans 
and  the  Structure  of  Behavior  (Henry  Holt  and  Company,  New  York,  I960),  pp.  146- 
TT7. 

4.  I  say  "in  principle"  because,  in  practice,  limitations  of  perception,  mem¬ 
ory,  mortality,  etc. ,  prevent  the  speaker  from  applying  his  knowledge  of  the 
rules  of  the  language  to  provide  himself  with  the  meaning  of  certain  sentences. 
This  situation  is  exactly  analogous  to  the  case  of  a  person's  knowledge  of  the 
rules  of  arithmetic  computation.  Knowing  how  to  perform  any  computation  and 
knowing  the  rules  of  arithmetic  computation  are  not  sufficient  to  enable  some¬ 
one  to  actually  perform  any  (specific)  computation;  for,  again,  limitations  of 
perception,  memory,  mortality,  etc. ,  stand  in  the  way. 

5.  Such  information  will  be  needed  to  provide  the  difference  upon  which  rests  the 
distinction  in  meaning  between  sentences  composed  of  exactly  the  same  morphemes, 
e.g. ,  Gourmets  do  approve  of  people  eating,  Gourmets  do  approve  of  eating  people, 

Do  gourmets  approve  of  people  eating?,  and  so  on.  - 

6.  N.  Chomsky,  Syntactic  Structures  (Mouton  and  Company,  'S-Gravenhage,  Second 
Printing,  1962),  Chapter  4.  In  general,  we  shall  follow  Chomsky's  conception  of  syntax. 

7.  I  shall  use  the  notational  abbreviations:  "NP"  for  a  noun  phrase,  "VP"  for  a  verb 
phrase,  "N"  for  a  noun,  "V"  for  a  verb,  *T"  for  an  article,  "A"  for  an  adjective,  "C" 
for  a  co-ordinating  conjunction,  and  the  subscript  symbols  "sing,"  "pi,"  and  "tr"  for  the 
syntactic  properties  of  nominal  singularness,  nominal  pluralness,  and  verbal  transitivity, 
respectively. 

8.  In  particular,  an  optimal  grammar  will  include  a  specification  of  the  transfor¬ 
mational  history  for  each  sentence.  Cf.  N.  Chomsky,  op.  cit. 

9-  For  the  first  of  these  two  examples  I  am  indebted  to  Professor  Uriel  Weinreich, 
Columbia  University,  and  for  the  second  to  Professor  George  A.  Miller,  Center  for 
Cognitive  Studies,  Harvard  University. 

10.  In  a  small  minority  of  cases,  dictionary  entries  consist  of  instructions,  e.g., 
the  rules  for  not  that  are  given  in  J.  J.  Katz,  Analyticity  and  contradiction  in  natural 
language,  Readings  in  the  Philosophy  of  Language,  op.  cit.  For  a  further  discussion  of 
the  type  or  entry  Found  in  the  vast  majority  or  cases  see- J.  J.  Katz  and  J.  A.  Fodor, 
op.  cit. ,  Section  6. 
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11.  Two  comments  on  Fig.  XIV -2.  First,  the  word  bachelor,  though  a  noun,  can 
select  and  exclude  other  nouns  in  various  types  of  constructions,  e.g.,  in  noun-noun  cases 
such  as  He  is  my  bachelor  friend,  or  in  noun-in-apposition  cases  such  as  Mr.  Smith, 
the  neighborhood  bachelor,  is  here.  Thus,  we  must  represent  bachelor  as  having  a 
selection  restriction  for  each  sense;  thus  the  terminal  element  for  each  path  in 
Fig.  XIV -2  is  a  selection  restriction  enclosed  in  angles.  Second,  the  particular  selec¬ 
tion  restrictions  are  omitted  because  their  inclusion  would  only  complicate  matters 
unnecessarily  at  this  point. 

12.  For  a  further  examination  of  the  distinction  between  the  notions  'seman¬ 
tic  marker'  and  'distinguisher',  see  J.  J.  Katz  and  J.  A.  Fodor,  op.  cit. , 
Section  6. 

13.  Ibid.,  loc.  cit. 

14.  For  a  full  discussion  of  this  step  see  the  treatment  of  rule  (I)  in  Katz  and  Fodor, 
ibid. ,  Section  7. 

15.  Thus,  for  example,  it  makes  no  sense  to  include  the  semantic  markers  (Human) 
and  (Female)  twice  in  the  path  associated  with  the  compound  spinster  aunt  because  both 
of  the  constituent  paths  contain  occurrences  of  both.  The  second  occurrence  of  (Human) 
or  (Female)  would  provide  no  semantic  information  whatever. 

16.  Although  this  is  not  the  only  aim  of  a  grammar.  Cf.  N.  Chomsky,  On  the  notion 
'rule  of  grammar',  Proc.  Symposia  in  Applied  Mathematics,  Vol.  12,  The  Structure  of 
Language  and  Its  Mathematical  Aspects,  edited  by  ft.  Jakobson  (American  Mathematical 
Society,  New  York,  1961),  pp.  6-24. 

17.  Cf.  N.  Chomsky,  A  transformational  approach  to  syntax.  Third  Texas  Conference 
on  Problems  of  Linguistic  Analysis  of  English,  edited  by  A.  A.  Hill  (University  of  Texas, 

1962)7  pp.  nr-iirr 

18.  N.  Chomsky,  Syntactic  Structure,  op.  cit.;  On  the  notion  'rule  of  grammar', 
op.  cit.;  The  Logical  Structure  of  Linguistic  Theory,  1955  (microfilm  available  in 
Hayden  Library,  M.I.  T.).  Cf.  P.  Postal,  Some  further  limitations  of  phrase -structure 
grammars.  Readings  in  the  Philosophy  of  Language,  op.  cit. 

19.  This  point  has  been  discussed  outside  the  context  of  the  conception  of  a  semantic 
theory  adopted  in  the  present  report  in  two  recent  papers:  J.  A.  Fodor,  Projection  and 
paraphrase  in  semantics.  Analysis  cl,  73-77  (1961);  J.  J.  Katz,  A  reply  to  'Projection 
and  Paraphrase  in  Semantics',  Analysis  22,  36-41  (1961). 


B.  REMARKS  ON  THE  MORPHOPHONEMIC  COMPONENT  OF  RUSSIAN 

In  this  report  we  present  an  entirely  new  synchronic  analysis  of  the  morphopho- 
nemic  component  of  the  contemporary  standard  literary  dialect  of  Russian.  Although 
our  analysis  is  original,  we  nevertheless  draw  the  reader's  attention  to  the  general 
similarity  of  the  rules  that  we  propose  here  and  the  rules  already  proposed  by  previous 
investigators.1  The  originality  of  our  analysis,  therefore,  consists  primarily  in  the 
underlying  (phonemic)  structure  which  we  postulate  for  Russian. 

We  consider  here  only  verb  forms,  and  in  the  rules  presented  below  we  shall  not 
dwell  on  particularities  of  pronunciation  which  have  been  adequately  described  else- 
2 

where. 

We  shall  be  concerned  in  particular  with  the  following  forms: 
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■ 


a 
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V) 

♦» 

« 

*T 


c  5 
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One  of  the  Morpheme -Structure  Rules  that  we  postulate  for  Russian  states  that  no 
terminal  vocabulary  item  may  end  in  a  vowel.  This  rule  will  convert  terminal  vocab¬ 
ulary  items  like  /znoi/  and  /giu/  to  /zn5j/  and  /gTw/.  For  the  purposes  of  the  present 
report  we  may  state  this  MS -Rule  as  follows: 

MS-1  [-cons]  —  [-voc]  in  env:  _  + 

Note  that  because  of  this  rule  we  may  use  archiphonemes  specified  solely  for  the 
features  consonantal  and  gravity  in  terminal  vocabulary  items  like  /znoi/  and  /gTu/,  etc. 
In  the  phonemic  forms  presented  above  we  have  used  the  following  vowel  system: 

segment:  uioe  uioe 

tense:  +  +  +  +  -  -  -  - 

diffuse:  +  +  --  +  +  -- 

grave:  +-  +  -  +  -  +  - 

We  use  consonant  letters  as  abbreviations  for  the  appropriate  distinctive  feature 
matrices.  We  draw  attention,  however,  to  the  fact  that  in  the  forms  presented  above 
palatalization  (sharping)  of  consonants  is  completely  predictable  (this  fact  may  be  for¬ 
mally  presented  in  a  Morpheme -Structure  Rule  to  the  effect  that  all  consonants  —  with 
some  limitations  which  we  have  discussed  elsewhere  —  are  specified  non-sharp). 

We  require  application  of  the  following  rules  to  the  phonemic  representation  in  order 
to  derive  the  correct  phonetic  representations.  C -Rules  apply  cyclically  to  segments 
within  innermost  parentheses;  P-Rules  apply  to  forms  derived  from  the  C-Rules. 

C-l  Insert  J  in  env:  _ +  f+voc  "1  +  f+voc  4 

-cns  -cns 
L+tns  J  Lrtns . 

C-2  u  —  w  in  env:  _ +  r+vocl 

I -cns] 

C-3  V?  —  0  in  env:  _ +  ("+voc"l 

I -cns  I 

C-4  Erase  parentheses  and  return  to  C-l.  If  there  are  no  more  parentheses,  then 
go  to  P- 1 . 


P-1 

[ +obs  1 

[+cmp  J 

-  1 

+str"|  in  env: 

-grvj 

r+voc  lx 

-cns 
L-grv  J 

where  X 

is  not 

+  #  . 

P-2 

[+cons] 

[+sharp]  in  env: _ 

_  ["  -cns 

[  -grv 

P-3 

t 

H 

in  env:  X  +  (t,)  +  # 

where  X  contains  a  stressed  vowel  and  does  not  end  in  two  consonants . 
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P-4 

V  V 

Transitive  softening  (i.e.,  -  s, ,  d»  j  •»  j9 ,  b,  j  bl, ,  k,  j 

etc.,  and  j  —  0  in  env:  C  ). 

P-5 

e  — 

o_  in  env:  _ 

_  r+consl 

|_-shrpj 

P-6 

X  - 

f-cmpl 

L-fit  J 

P-7 

|--tnsl 

L+grvj 

-  [+m] 

►  in  env:  _  _ 

+VOC 

-cns 

P-8 

[-dif] 

-  [+cmp]  _ 

P-9 

el  - 

It 

P-10 

V  - 

0  in  env: _ 

_ V 

P-11 

w  — 

V 

V 

C,  , 


P-12 


fl 


P-13  «  -  a 

P-14  Vowel  reduction;  raising  of  [e]  to  [e]  before  soft  consonants;  backing  of  [i]  to 
[4]  after  hard  consonants. 


P-15  p-vocT  —  0  in  env:  _ +  [icons] 

LrcnsJ 

P-16  "*  ^  In  env: _ +  1 

P-17  1-0  in  env:  C+ _ 

P-18  Erase  all  boundary  and  juncture  markers. 


P-Rules  6-8  apply  to  all  vowels  and  specify  features  absent  in  the  matrix  presented 
above.  Note  that  after  application  of  P-Rules  6-8  the  vowel  segments  presented  above 
will  have  the  following  distinctive  feature  matrices: 


segment:  u 
tense:  + 
diffuse:  + 
grave:  + 
compact:  - 
flat:  - 


i  o  e 

+  +  + 

+ 

+ 

-  +  + 


u  i  o  e 

+  + 

+  -  +  - 

-  -  +  + 

+  -  + 


Inspection  of  this  matrix  will  reveal  that  application  of  P-Rules  6-8  have  the  following 
effect  (symbols  to  the  left  of  the  arrow  those  of  the  matrices  above,  to  the  right  of  the 
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arrow  those  of  the  International  Phonetic  Alphabet): 

U  —  i  U 

I  -  i  i  -  r 


o 

e 


o  —  o 
c  -  e 


We  apply  these  rules  to  some  of  the  phonemic  forms  presented  above: 

/  /  /  ,  „ 
la:  (nes+tl)  —C-4—  nes+tf  —  P-2—  n,es+t,I  —P-14—  n,7s+t,i  —P-18—  n.lst,  i 


lb:  ((nes+d)+t)  -C-4—  (nes+d+t)  -C-4-  nes+d+t  —P-2—  n,  es.+d+t  —P-5— 

n,  ea,+o¥t  —P-6,  P-7,  P-8—  n,zs,+o+t  —P-14—  n,Is,+o+t  —P-18—  n,  Is,ot 


(nds+1) 

—C-4—  nds+1 

—P-2—  n,ds+l  —P-5—  n,  os+1 

-P-6,  P-7,  P-8— 

n,  Ss+1 

-P-17-  n.o's 

(tdk+ti) 

-C-4-  tdk+tl 

-P-2-  t,dk+t,i  -P-3-  t,dk+t. 

—P-4—  t.dc,  -P-6, 

P-8,  P- 

14-  t,dc. 

2b:  ((tek+d)+t)  -C-4-  (tek+4+t)  -C-4-  tek+d+t  -P-1-  tec+d+t  -P-2-  t,ec,+d+t 

-P-5-  t,  ec,  +o+t  -P-6,  P-7,  P-8,  P-14-  t,Ic,+o+t  -P-18-  t,Ic,o"t 

i  1  / 

2c:  (((tek+e)+I+#)+te)  -C-4-  ((tek+e+i+#)+te)  — C-3—  ((tek+T+#)+te)  -C-4— 

(tek+f+#+te)  -C-4-  tek+f+#+te  -P-2-  t,ek,+i+t,e  -P-6,  P-8.  P-14,  P-18- 

t.Xk.  it,  x 


3a: 


4a: 


4b: 


5a: 


6a: 


6b: 


((leub+I+I)+ou)  —C-3—  ((leub+l)+ou)  —C-4—  (leub+I+ou)  — C-l—  (leubj+i+ou) 
—C-3—  (leubj+ou)  —C-4—  leubj+ou  —P-2—  1,  eub,  j+ou  —P-4—  1,  eubl,  +ou 

-P-10-  1,  ubl,  +u  -P-18-  l.ubl.u 


((pl5k+<H-e)+t)  —C-l—  (plokj+o+e)+t)  —C-3—  {(plokj+e)+t) 
—C-4—  plokj+e+t  —P-2-  pi  ok,  j+e+t  -P-4—  ploc'.+e+t 
-P-6,  P-7,  P-8-  plac,  +o+t  -P-14-  plac,+I+t  -P-18- 

(((plok+o+e)+T+#)+te)  —C-l—  (((plokj+o+e)+i+#)+te)  — C-3— 
-C-4-  ((plflkj+e+i+#)+te)  -C-3-  ((plokj+i+#)+te)  -C-4- 
pl6kj+I+#+te  -P-2-  plok,  j+i+#+t,  e  -P-3-  pllk,j+#+t,e 
—P-6,  P-8,  P-14,  P-18-  plac,  t,  I 


✓ 

—C-4—  (plokj+e+t) 
—P-5—  ploc',  +o+t 
pla£,It 

(((plokj+e)+*i+#)+te) 
(plokj+T+#+te)  — C-4— 
—P-4—  plSc'.+f+t.e 


(leg+e+1)  -C-4-  leg+e+1  -P-1-  lej+e+1  -P-2- 

1, ej,+»+l  —P-12-  l,tz+«+i  —P-13—  l,cz+a+l 

1,1  zal 

(znoj+tl)  -C-4-  znoj+tl  -P-2-  zn5j+t,I  -P-3' 

znaj+t,  —P-15—  zna+t,  —P-18—  znat, 

(((znoj+e)+l+#)+te)  -C-4-  ((znoj+e+l+#)+te)  — C-3- 


1,  et  +e+l  -P-6,  P-8— 
P-14-  l,lz+a+l  — P-18— 


znoj+t, 


-P-6,  P-8— 


((znoj+i+D+te)  — C-4— 
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/  ✓  _  /  .  s 

(zn5j+T+#+te)  —C-4—  znoj+i+#+te  —P-2—  zno.1+i+#+t,  e  —P-3—  znoj+#+t,  e 

—P-6,  P-8—  znaj+(H+t,e  —P-14—  znaj+#+t,I  -P-18-  znajt.I 

8a:  ((obroz+ou+o+e)+ou)  — C-l—  ((obroz+ouj+8+e)+ou)  — C-3—  ((obroz+ouj+e)+ou) 

— C-4—  (obroz+ouj+e+ou)  —C-3-  (obroz+ouj+ou)  — C-4—  obroz+ouj+ou  —P-6, 
P-7,  P-8—  obraz+ouj+au  —P-10—  obraz+uj+u  -P-14—  abraz+uj+u  — P-18— 
abrazuju 

/  / 

8b:  (obroz+ou+o+1)  — C -2—  (obroz+ow+o+1)  —C-4—  obroz+ow+o+1  —P-6,  P-7,  P-8— 
obraz+ow+a+1  —  P-11—  obraz+ov+a+l  —  P-14—  abr8z+av+a+l  —P-18—  abrazaval 

9a:  (glffd+ei+1)  -C-4-  gled+ei+1  -P-2-  gl,ed,+ei+l  -P-6,  P-8-  gl,«ed,4ti+l 

-P-9-  gl,*d,+le+l  -P-10-  gl,«d,+e+l  -P-14-  gl,Id,+e+l  -P-18-  gl.Id.el 

T.  M.  Lightner 
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formulated  by  Halle  in  his  "Note  on  cyclically  ordered  rules  in  the  Russian  conjugation” 
will  not  satisfy  us  because  we  do  not  consider  iciat1.  lgat1,  brat',  etc.  to  be  formed  from 
nonsyllabic  roots,  but  rather  from  the  roots  /gid/,  /lug/,  /blr/,  etc.  In  order  to  pre¬ 
vent  application  of  C-l  to  these  verbs,  we  intend  to  follow  some  variant  of  Halle's  most 
recent  suggestion  that  this  information  be  listed  in  the  Complex  Symbol  for  monosyl¬ 
labic  roots  containing  the  archiphoneme  {i,  g}.  Note  that  we  require  th^,  presence  of 
this  vowel  not  only  in  order  to  account  for  the  derived  imperfectives  vyzidat'.  oblygat1, 
yybirat',  etc. ,  but  also  to  account  for  the  derived  nominala  IdS',  vybor,  etc.  (we  defer 
treatment  of  imperfective  derivation  and  nominalization  for  a  later  report).  We  shall 
postulate  a  special  marker  in  the  Complex  Symbol  to  signal  retention  of  these  lax, 
diffuse  vowels  in  the  present  tense  of  brat'  (beru),  zvat'  (zovu),  drat'  (deru),  stlat' 
(stelju).  "  "  ~  - 
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We  require  further  that  the  verb  soaat'  (aoau,  soset,  etc.)  be  derived  from  the  root 
/sue/,  not  only  to  account  for  the  lack  of  application^  C-l  but  also  to  account  for  the 
dialectal  forms  asu.  a  set,  etc.  See  A.  G.  Preobrazenskij,  Etlmologiffe  aklj  alovar1 
russkago  jazyka  (Columbia  University  Preaa,  New  York,  1951),  II.  360,  and  Max  Vaamer, 
Russisches  etymologiachea  Wbrterbuch  (Carl  Winter,  Univeraitatsverlag,  Heidelberg, 

vmra,  - 

C.  TYPE  1  GRAMMARS  AND  UNEAR-BOUNDED  AUTOMATA 

Recently,  Landweber 1  showed  that  the  language  accepted  by  a  deterministic  linear - 

2 

bounded  automaton,  in  the  sense  of  Myhill,  can  be  generated  by  a  type  1  grammar,  in 
3 

the  sense  of  Chomsky.  Landweber's  proof  remains  valid  for  a  nondeterministic  linear- 
bounded  automaton.  As  the  converse  of  Landweber's  theorem,  we  have 

THEOREM:  The  language  generated  by  a  type  1  grammar  is  accepted  by  a  nondeter¬ 
ministic  linear-bounded  automaton. 

Thus,  we  now  have  equivalent  heirarchies:  Turing  machines,  linear-bounded  autom¬ 
ata,  pushdown  storage,  and  finite  automata  on  the  one  hand,  and  semi-Thue  systems, 

type  1  grammars,  type  2  grammars,  and  type  3  grammars,  on  the  other.  (For  the 

4  3  5 

first  pair,  see,  for  example,  Davis  :  for  the  last  two  pairs,  Chomsky.  ) 

The  proof  of  the  theorem  will  consist  of  three  lemmas.  Before  stating  the  lemmas, 

we  shall  define  a  few  notions.  According  to  Chomsky,3  a  grammar  is  type  1  if  each 

rule  is  of  the  following  shape: 

$A>(i  —  +q«(i,  1(A)  =  1,  l(u)  *  0. 

Here,  1(4> )  means  the  length  of  $  •  We  generalize  his  notion  a  bit,  and  understand  a 
type  1  grammar  to  be  a  semi-Thue  system  in  which  each  of  its  rules  $  —  $  '  satisfies 
1($)  *  1($').  If,  furthermore,  (a)  1(4> )  ■  1($  ')  and  4 '  does  not  contain  the  S-symbol  S, 
or  (b)  )  :S,  then  the  grammar  is  called  length -preserving.  On  the  other  hand,  we 
define  the  order  of  a  grammar  to  be  the  maximum  of  the  lengths  of  the  strings  appearing 
in  the  rules.  If  an  order  2  grammar  is  length-preserving  and  if  S  —  EF  implies  that 
E  «  S,  it  is  said  to  be  linear-bounded.  Then  we  have  the  following  lemmas. 

LEMMA  1:  For  any  type  1  grammar  G,  there  exists  an  order  2  type  1  grammar 
G'  equivalent  to  G:  L(G)  *  L(G'). 

Here,  L(G)  means  the  language  generated  by  G.  Notice  that,  since  an  order  2  type  1 
(in  the  above-mentioned  sense)  grammar  is  easily  seen  to  be  equivalent  to  an  order  2 
grammar  that  is  type  1  in  Chomsky's  sense,  our  notion  of  type  1  grammar  turns  out  to 
oe  equivalent  to  the  notion  of  type  1  grammar  in  Chomsky's  sense,  as  far  as  the  equiv¬ 
alence  of  grammars  is  concerned. 
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Ll'MMA  2:  For  any  order  2  type  1  grammar  U1,  there  exists  a  linear-bounded 
grammar  G"  equivalent  to  G1:  L(G')  ■  L(G"). 

LEMMA  3:  For  any  linear-bounded  grammar  G",  there  exists  a  nondeterministic 
linear-bounded  automaton  that  accepts  L(G"). 

Our  theorem  follows  directly  from  these  lemmas. 

S.  -Y.  Kuroda 
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D.  FORMAL  JUSTIFICATION  OF  VARIABLES  IN  PHONEMIC  CROSS-CLASSIFYING 
SYSTEMS 


Variables  in  phonemic  rules  have  proved  extremely  useful. 1-3  There  is 
also  a  clear  formal  necessity  that  variables  be  included  in  any  system  with  cross- 
classificatory  features  (whether  or  not  the  features  are  binary).  If  variables  were  not 
cost-free  with  respect  to  a  simplicity  criterion,  the  segments  [+compact],  [+grave] 
would  appear  to  be  related  to  each  other  in  a  more  fundamental  sense  than  the  segments 
[-compact],  [+grave].  But  it  is  only  an  arbitrary  decision  to  measure  frontness  in  terms 
of  gravity.  '+grave 1  is  exactly  equivalent  to  '-acute the  value  of  the  polarity  of  a  given 
feature  quality  is  not  a  substantive  part  of  the  theory.  Therefore  the  metatheory  requires 
that  the  features  in  a  cross-classifying  system  are  all  marked: 

[OaF] 


F  —  grave 
compact 


(by  convention,  the  identity  operation  e  is 
left  blank) 
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with  the  operations  ~  +  «  -  e+  ■  + 

~  +  e-  *  - 

In  phonemic  matrices  and  nonvariable  rules  the  value  of  O  is  usually  affirmative, 
and  a  is  specified  with  '+'  or  but  this  is  purely  a  convention;  it  would  be  exactly 
equivalent  to  maintain  generally  the  value  of  a  *  +  and  specify  the  phonemic  matrices 
by  the  symbols  1  or  'e'. 

The  availability  of  different  operators  clearly  shows  that  the  segments  [^compact], 
[-grave]  are  in  as  close  a  relation  as  [+compact],  [+grave].  For  instance,  the  assim¬ 
ilation  rules 

[  ]  -*  [+comp]  in  the  env  —  [+grave] 

—  [-comp]  in  the  env - [-grave] 

are  more  simply  combined 

A)  [  ]  —  [acomp]  in  the  env - [agrave] 

and  the  assimilation  rules 

[  ]  **  [-comp]  in  the  env - [+grave] 

—  [+comp]  in  the  env - [-grave] 

are  combined 

B)  [  ]  —  [~acomp]  in  the  env - [agrave]. 

If  the  front  quality  of  segments  were  marked  with  acuteness  instead  of  gravity,  rules  A) 
and  B)  would  be  equally  simple:  only  the  polarity  is  changed 

A')  [  ]  —  [~ acomp]  in  the  env -  [aacute] 

B')  [  ]  —  [acomp]  in  the  env -  [aacute]. 

The  arbitrariness  of  the  polarity  values  is  represented  by  the  existence  of  cost-free 
variables  that  make  [+x]  =  [~-X']  where  X  and  X'  are  opposite  extreme  values  along 
the  same  feature  continuum  (e.g.  {acute,  grave;  abstract,  concrete}). 

In  phonology  the  use  of  variables  over  +  and  -  and  operators  covers  various 
phenomena: 
assimilation 

[  ]  —  [aX]  in  the  env - [aX] 

dissimilation 

[  ]  —  [~aX]  in  the  env - [aX] 

internal  assimilation 
[aX]  -  [aY] 
internal  dissimilation 
[aX]  -  [~«Y] 
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external  environment  specification 

[  ]-[«]/ - l°X]  [~oY] 

exchange  (internal  dissimilation) 

[eX]  -  [~«X] 

If  the  exchange  rule  applies  in  a  transformational  cycle,  the  net  effect  is  one  of 
reciprocation  with  respect  to  the  feature  X.  If  the  affected  segment  is  contained  within 
an  odd  number  of  constituents  to  which  the  exchange  rule  applies,  the  net  effect  is  dis- 
similative;  if  it  applies  an  even  number  of  times,  there  is  no  net  effect.  (Sections  XIV-E, 
XIV-F,  and  XIV-G  present  the  use  of  such  a  rule  in  the  Indo-European  e/o  ablaut.) 

T.  G.  Bever 
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E.  THE  RECIPROCATING  CYCLE  OF  THE  INDO-EUROPEAN  E/O  ABLAUT 

The  Indo-European  (IE)  e/o  ablaut  can  be  described  by  a  reciprocating  rule  of  the 
type  discussed  in  Section  XIV  -D.  The  traditional  presentation  of  the  e/o  ablaut  is  in 
terms  of  the  cases,  tenses  or  other  derivations  of  the  ablauting  stems.  An  extremely 
telling  observation  is  that  words  in  compounds  often  have  the  opposite  grade  from  the 
words  alone.1  This  clearly  indicates  that  the  number  of  constituents  in  which  the 
ablauting  stem  is  contained  is  critical.  In  generative  grammar,  the  combination  of  a 
morphophonemic  cycle  and  an  exchange  rule  is  sensitive  to  the  odd  or  even  quality  of 
the  number  of  constituents.  The  IE  rule  is  of  the  form 

[agrave]  ■»  [-cgrave] 

and  it  applies  in  a  cycle.  If  the  number  of  constituents  containing  the  ablauting  vowel 
is  odd,  the  grade  is  changed;  if  it  is  even,  the  grade  is  unchanged.  Sections  XIV-F 
and  XIV-G  present  the  operation  of  this  rule  in  Germanic  and  Greek.  In  the  Germanic 
languages  the  scope  of  the  ablaut  is  sharply  restricted,  but  the  similarity  of  the  essen¬ 
tial  rule  to  that  of  Greek  indicates  that  IE  itself  had  a  morphophonemic  transformational 
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cycle  that  generated  the  e/o  ablaut  alternations. 


T.  G.  Bever,  D.  T.  Langendoen 
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F.  THE  E/O  ABLAUT  IN  OLD  ENGUSH 

The  Germanic  reflex  of  the  Indo-European  e/o  ablaut  appears  in  the  nonreduplicating 
strong  stems.  The  Germanic  strong  verbs,  by  Grimm's  definition,  show  stem-vowel 
changes  in  derived  forms.  This  analysis  follows  Keyser's  suggestion  that  the  historical 
distinction  between  the  strong  and  weak  stems  is  operative  in  Old  English:  strong  stems 
are  phonemically  monosyllabic;  weak  stems,  polysyllabic . 1 

The  Germanic  nonreduplicating  verb  ablaut  occurs  in  6  classes  that  are  tradition¬ 
ally  presented  as  6  "ablaut  series,"  each  containing  four  forms.  Since  each  of  the 
classes  is  phonemically  distinct,  the  entire  set  of  series  can  be  generated  by  5  basic 
rules.  Each  rule  is  concerned  with  a  single  change  in  quality  or  quantity.  ThiB  set 
occurs  in  a  morphophonemic  transformational  cycle. 

The  traditionally  presented  "principal  parts"  of  the  old  Germanic  strong  verb  are: 
present  infinitive,  preterite  singular,  preterite  plural,  and  past  participle. 


1.  Past  Participle 

The  strong  verb  past  participle  throughout  old  Germanic  is  generated  by  the  rules 


r+voc  1 
[_-conaJ 


—  0  in  the  env 


_  r+voc  l 

L-consJ 

■»  [+grave]  in  the  env  - [+sonorant] 


Although  these  rules  can  be  combined  with  the  strong  stem  cycle,  they  will  be  omitted 
to  simplify  this  presentation. 

The  relevant  constituent  structure  is  described  by  the  rules 


Verb  —  stem  (+extension)  —  Mood 
f  indicative 


Mood  —  PerNo  +< 


PerNo  — 


(§(» 


subjunctive 

infinitive 

participle 
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sg  -  +  0 

pi  -  +  on 


in  the  env  past  + 


In  Germanic,  the  ablauting  of  strong  stems  is  restricted  to  the  past  tense  or  the  deriva¬ 
tion  of  nouns.  This  is  represented  by  the  optional  stem  extension:  the  cyclic  rules  will 
apply  to  an  extended  strong  stem  only.  The  actual  morphophonemic  value  of  the  /+past/ 
stem  extension  is  not  critical,  although  it  is  probably  0.  This  interpretation  is  not  pro¬ 
posed  as  an  ultimate  solution:  it  merely  represents  the  fact  that  the  occurrence  of  the 
ablaut  is  limited  in  Germanic. 

The  form  of  the  stems  to  which  the  stem  cycle  applies  is 

(((#  stem  +  Ext)  +  |p*|)  +  #) 


and  in  the  cases  presented  here 

(({#  stem  +  Ext)  +  |on|)  +  #)» 
where  /+ext/  has  the  value  /+past/.  In  OE  the  rule 
C-l)  [egrave]  ■*  [~ograve]  in  the  env  [Long  Syllable] 

is  a  reciprocating  rule  of  the  type  discussed  in  Section  XIV-E.  Among  the  Germanic 
languages  this  rule  is  restricted  to  OE  because  only  in  OE  does  the  distinction  between 
/«/  and  /a/  "re-emerge"  from  reconstructed  Proto -Germanic. 

The  environment  "Long  Syllable"  is  the  same  as  that  formulated  by  Keyser  for  the 
weak  verb  cycle1: 

[+cons]  [+voc]a  [+son]p  [+cons], 

where  a  +  0  >  2. 

2  3 

The  cycle  reflects  the  historical  verb  class  derivation,  '  since  it  is  fairly  clearly 
divided  into  two  sections:  the  first  3  rules  apply  critically  to  the  first  3  classes  and 
the  final  2  rules  apply  critically  to  the  last  3  classes.  The  fact  that  all  the  environ- 

3 

ments  are  included  between  #CQ - — C  blocks  the  application  of  rule  C-l)  to  the 

vowel  in  /rid/  or  /far/.  It  also  blocks  the  application  of  rule  C-4)  to  each  vowel  of 
/raad/  or  the  vowel  of  /band/  because  rule  C-4)  is  actually 

[+comp)J  —  [+long]  j  in  the  env  #C* - C  +  ext 

The  effect  of  the  environment  /+ext  0/  is  to  limit  the  application  of  rules  C-4-b)  and 
C-5)  to  the  first  cycle,  when  nothing  follows  the  /+ext/. 

One  incorrect  form  is  generated  by  these  rules  -  the  preterite  singular  of  /beeodon/. 
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It  comes  out  /baa«d/  instead  of  the  correct  form  /b««ad/,  but  the  diphthong  structure 
rules  will  correct  this.  In  order  not  to  violate  the  distinctness  convention  the  diphthong 
structure  rules  need  to  be  included  at  the  beginning  of  the  cycle. 


The  Old  English  Weak  Verb  Cycle 
Pre -final  restriction 

C-l)  [agrave]  —  [~agrave]  in  the  env  'I 


Final  restriction 
no  restriction  C-l 


y  +-* 


C-2) 

[  1  ■*  [+diff]  In  the 

env 

1  J 

C-3) 

[  ]^  —  0  in  the  env  — 

C-4) 

[+comp]j  -  [+long]J 

a)  in  the  env 

b)  in  the  env 

C-5) 

[a  comp]  —  [~acomp] 

in  the  env 

[+son]  < — * 


[+voc 
-cons. 


»  1 


C-2 


C-3 


C-4  a) 
C-4  b) 
C-5 


All  rules  apply  in  the  env  f+voc  1,  in  extended  strong  stem  only,  i.e.,  the  pre-final 

(.-consj 

environment  for  rules  C-l),  C-2)  is  actually  #C^<— »  [+son]j*— ►  Cj  +ext;  for  rule  C-3): 

#Co  t - 1?  [!cots]  C1  +ext:  tor  rules  C'4>'  C"5):  <  t - 1C1  +ext- 

Low-level  rules 

p)  Past  -  0 

q)  W  -  V 

r)  Diphthong  structure:  [+comp]j  —  [-grave]^,[+grave] 

(may  be  ordered  before  C-l) 


Mor  phophonemic  ally , 

long  vowels  are  considered  sequences 

of  short  vowels. 

Class 

Infinitive 

Preterite  Sg. 

Preterite  PI. 

1 

rid  an 

rad 

ridon 

2 

beodan 

bSad 

bud  on 

3 

bind  an 

band 

bund on 

4 

beran 

b«r 

baron 

5 

metan 

m«t 

mBton 

6 

far  an 

for 

for  on 
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-grave 

+gr  ave 

■•■diffuse 

i 

u 

f  -diffuse  ") 

A 

1  -compactj 

0 

+  compact 

m 

a 

OE  Examples 

(((#  riid  +  ext)  +  0)  +  #) 

(#  ruud  +  ext) 

(#  read  +  ext) 

(#  raad  +  ext  +  0) 

{#  raeaed  +  ext  +  0  +  #) 

{#  rawed  +  ext  +  0  +  #) 

(#  raad  +  ext  +  0  +  #) 

§  rad  # 

(((#  b  jeod  +  ext)  +  on)  +  #) 
(#  booed  +  ext) 

(#  baaaed  +  ext) 

(#  baaaed  +  ext  +  on) 

(#  baeaead  +  ext  +  on) 

(#  biiud  +  ext  +  on) 

(#  bud  +  ext  +  on  +  #) 

#  budon  # 

(((#  bind  +  ext)  +  on)  +  #) 
(#  bund  +  ext) 

(#  band  +  ext) 

(+  band  +  ext  +  on) 

(#  baend  +  ext  +  on) 

(#  bind  +  ext  +  on) 

(#  bind  +  ext  +  on  +  #) 

(#  bund  +  ext  +  on  +  #) 

#  bundon  # 

(((#  met  +  ext)  +  on)  +  #) 
(#  meet  +  ext) 

(#  meet  +  ext  +  on) 

(#  m»  t  +  ext  +  on) 

(#  mBt  +  ext  +  on  +  #) 


Preterite  Sg.  -  Class  1 
C-l 
C-5 

New  Cycle 
C-l 

New  Cycle 
C-l 

Low-Level  Rules 

Preterite  PI.  —  Class  2 
C-l 
C-5 

New  Cycle 

C-l 

C-2 

New  Cycle 
Low-Level  Rules 

Preterite  PI.  —  Class  3 
C-l 
C-5 

New  Cycle 

C-l 

C-2 

New  Cycle 
C-l 

Low-Level  Rules 

Preterite  PI.  -  Class  4 
C-5 

New  Cycle 
C-4 

New  Cycle 
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OE  Examples 

(((#  met  +  ext)  +  on)  +  #) 

#  mSton  # 

(((#  far  +  ext)  +  0)  +  #) 

(§  fSr  +  ext) 

(#  f6r  +  ext) 

(#  for  +  ext  +  0) 

(#  fSr  +  ext  +  0  +  #) 

#  for  # 


Preterite  PI.  -  Class  4 
Low-Level  Rules 

Preterite  Sg.  —  Class  5 
C-4 
C-5 

New  Cycle 
New  Cycle 
Low-Level  Rules 

T.  G.  Bever 
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G.  THE  E/O  ABLAUT  IN  GREEK 


The  well-known  alternations  between  the  vowels  _e_  and  o_in  verbal  roots  in  Greek 
can  be  predicted  from  the  constituent  structure  of  the  words  in  which  these  roots  appear 


and  a  rule  of  the  form 


A. 


[a grave]  -*  [~agrave]  in  env 


-diffuse 

-compact 

-cons 


X  +  Y  (+#)  where  X,  Y  do  not  contain  D 


operating  in  a  transformational  cycle. 

To  show  how  this  rule  operates,  we  require  a  statement  of  the  internal  constituent 
structure  of  Greek  verbs  and  of  nominals  derived  from  verbs.  Despite  all  of  the  atten¬ 
tion  that  linguists  have  paid  to  these  forms  in  the  past,  no  such  formulation  has  ever 
been  attempted  for  Greek.  Consequently,  the  formulation  that  I  present  in  this  report 
below  must  be  regarded  as  being  highly  provisional.  To  substantiate  the  claims  that 
I  make,  or  to  refute  them,  we  require  a  thorough  statement  of  Greek  syntax  so  that  we 
can  see  how  the  rules  embodying  these  claims  fit  into  such  a  syntax. 


1 .  Constituent  Structure  of  the  Verb 


Ignoring  for  the  moment  the  low-level  selectional  restrictions,  the  phrase  structure 
of  the  Greek  verb  may  be  schematized  by  means  of  the  following  phrase  structure  rules: 
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PI.  V  —  (Augment)  V^age  Person  Number  (Voice) 

P2‘  Vbase  *  Vstem  Theme  (Mood) 

P3‘  Vstem~  Vroot  <Tense> 

P4.  Voice  -  Middle 

(imperative  1 
P5.  Mood  -*4  Optative  > 

[.Subjunctive J 


P6. 


Tense 


Present 
Future 
J  Aorist 
]  Perfect 

Passive  (Future) 
Aorist -Passive 


When  the  voice  constituent  is  missing,  the  verb  is  said  to  be  in  the  active  voice;  without 
the  mood  constituent  it  is  in  the  indicative  mood,  and  without  the  tense  constituent,  it  is 
in  the  second  aorist  tense  and  the  augment  constituent  must  be  present.  Only  certain 
verbs  can  appear  in  the  second  aorist.  In  a  complete  Greek  grammar,  we  shall  prob¬ 
ably  want  to  introduce  the  constituents  person,  number,  voice,  and  mood  transforma¬ 
tionally,  but  the  statement  given  here  is  adequate  for  our  purposes.  We  also  ignore 
the  problems  of  how  to  handle  the  "primary"  and  "secondary"  person  endings  of  the 
active  voice,  and  the  person  endings  in  the  perfect  tenses. 

The  theme  constituent  is  missing  in  certain  tenses  of  certain  verbs:  for  example, 
in  the  present  tense  of  verbs  whose  present  tense  constituent  is  nu.  This  constituent 
is  also  missing  in  the  perfect  middle  of  all  verbs;  its  absence  here  can  be  handled  by 
the  following  deletion  transformation: 


Tl.  Perfect  Theme  X  Middle 

1  2  3  4  -  1  0  3  4 


The  perfect  constituent  then  permutes  with  the  verbal  root: 

T2.  VrQot  Perfect 

1  2  -  2  1 


An  additional  constituent  is  then  inserted  after  the  root  in  the  perfect  active: 

T3.  Perfect  VrQot  Theme 

1  2  3-1  2+K  3 


The  constituents  K  and  perfect  are  rewritten  in  the  morphophonemics  by  the  rules: 

fk  in  env  [-cons])v  + - | 

'  v  root 


Ml.  K  —  < 


> 


0 
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.  ln.m— +[«™;ilc]2 

Ce  in  env - +  cj 

VCVta.nv  —  ♦[ZSSKjcJ 
V  in  env - +  V 

Note  that  rule  M2-c  gives  the  "Attic  reduplication." 

Given  this  apparatus,  we  are  able  to  predict  the  vocalism  of  verbal  roots  in  e:  we 
expect  e-vocalism  throughout  the  conjugation  except  in  the  perfect  active  tenses  of  verb 
roots  that  end  in  one  or  more  consonants.  Following  the  nomenclature  of  traditional 
Germanic  grammars,  we  shall  call  such  roots  strong;  roots  that  end  in  vowels,  we  shall 
call  weak.  Thus  consider  the  conjugation  of  the  verb  from  the  strong  verbal  root  streph, 
'twist,  turn1:  (we  conjugate  in  the  first  person  plural  throughout) 


1. 

Present  active 

(#(((+streph  +  0)+e)+men)+#): 

strfphomen 

2. 

Future  active 

(#(((+streph  +  s)+e)+men)+#): 

strlpsomen 

3. 

Perfect  active 

( #  (((+e+str  eph)+e  )+a+men)+#) : 

estrophamen 

4. 

Pluperfect  active 

( #  (+e  ( (+e+ stre ph)+e )+e+  men )+ # ) :  e  str dphemen 

5. 

Perfect  middle 

(#(+e+streph)+metha)+#): 

estr&nmetha 

The  a-vocalism  in  the  perfect  middle  is  a  consequence  of  the  zero -grade  or  vowel  dele¬ 
tion  rule,  which  I  have  not  formulated  here.  Where  we  find  full-grade  in  perfect  middles 
of  strong  roots  in  e,  we  find  the  e-vocalism;  thus  for  the  root  leip,  'leave',  we  have 

6.  Perfect  middle  (#(+le+leip)+metha)+#):  leleimmetha 

The  conjugation  of  the  weak  verbal  root  kheu,  'pour'  is  exactly  parallel  to  that  of 
streph,  except  that  in  the  perfect  active  tenses,  the  morpheme  k_  given  by  rule  Ml  is 
present  and  the  je-vocalism  is  maintained: 

7.  Perfect  active  (#(((+khe+kheu+k)+e)+a+men)+#):  kekheukamen. 

Since  the  only  form  in  which  rule  A  applies  an  odd  number  of  times  to  the  vowel  of 
the  verbal  root  is  in  the  perfect  active  tenses  of  strong  verbs,  only  there  do  we  find  the 
o- vocalism  in  the  root.  In  every  other  form  it  operates  an  even  number  of  times;  twice 
in  the  perfect  middle  and  four  times  elsewhere. 

2.  Constituent  Structure  of  Nominals  Derived  from  Verbal  Roots 

It  seems  reasonable  to  suppose  that  the  derivation  of  nominals  from  verbs  in  Greek 
is  merely  a  special  case  of  generalized  transformations  that  embed  into  noun  or  adjec¬ 
tive  phrases  deformations  of  full  sentences  in  which  the  verb  appears.  The  verbal  root, 
together  with  a  derivational  affix,  becomes  the  head  noun  or  adjective  in  the  phrase. 

The  resulting  noun  must  obtain  a  grammatical  gender,  and  presumably  it  gets  it  from 


M2.  Perfect  — < 


a) 

b) 

c) 

d) 
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derivational  affix.  I  claim,  for  the  moment  without  justification,  that  neuter  and  com¬ 
mon  gender  affixes  behave  differently  from  affixes  that  are  inherently  feminine  or  mas¬ 
culine  (by  common  gender  affix,  I  mean  an  affix  that  is  not  inherently  either  masculine 
or  feminine,  but,  depending  on  the  derived  word,  it  takes  cm  one  or  another  of  these 
genders).  The  derived  constituent  structure  of  nouns  having  the  first  type  of  deriva¬ 
tional  affix  is 


where  I  is  the  inflectional  suffix,  supplied  transformationally,  which  marks  the  case 
and  number  of  the  noun.  Nouns  formed  with  the  second  kind  of  derivational  affix  have 
the  structure 


N 


The  neuter  derivational  affixes  are  es  and  mat:  the  common  gender  affixes  are  the 
agentive  affixes  ter  and  tor;  inherently  feminine  affixes  are  a,  id,  and  ad;  inherently 
masculine  affixes  are  o  and  eu.  Given  this  derived  structure,  we  are  immediately  able 
to  account  for  the  vocalism  in  derived  nouns  from  verbal  roots  in  je :  derived  neuter  and 
agent  nouns  will  have  je-vocalisxh,  whereas  derived  feminine  and  masculine  nouns  formed 
from  the  affixes  just  listed  will  have  o.- vocalism  as  illustrated  by  the  following  examples, 
in  which  we  use  the  roots  streph  and  blep,  'look': 

8.  (#(+streph+mat+os)+#)  strdmmatos,  'a  twist' 

9.  (#(+blepfes+os)+#)  blepeos,  'a  look' 

10.  (#(+streph+ter+os)+#)  strepteros,  'anything  which  turns' 

11.  (#((+streph+o)+s)+#)  :  strophos,  'a  band' 

12.  (#({+streph+eu)+s)+#)  :  stropheus,  'a  vertebra' 

13.  (#((+streph+a)+0)+#)  stropha,  'a  turning  around' 

14.  (#((+streph+id)+s)+#)  :  strophis,  'a  girdle' 

15.  (#((+8treph+ad)+s)+#)  strophas,  'a  circling' 

Weak  verb  roots  that  end  in  diffuse  vowels  also  form  derived  nouns  with  the  same 
ablaut  relationship.  For  example,  from  the  root  kheu 

16.  Genitive  singular  (#(+kheu+mat+os)+#)  kheumatos,  'a  stream* 

17.  Nominative  singular  ( # (+kheu+o)+s )+  # )  khoos,  'a  liquid  measure' 


I  Genitive 
[  singular 


I  Nominative 
(  singular 
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18.  Nominative  singular  (#(kheu+a)+0)+#)  khoa,  'a  libation1 

and  from  the  root  dei,  'fear' 

19.  Genitive  singular  (#(+dei+es+os)+#)  :  deeos,  'fear' 

3.  Extent  of  Ablaut  in  Greek 

Verbal  roots  with  fundamental  vocalism  o  rather  than  e  do  not  undergo  ablaut.  Thus, 
although  the  verbal  root  kop  'strike'  is  strong,  the  vocalism  of  the  perfect  is  not  different 
from  that  of  the  other  tenses;  the  present  active  is  koptomen,  and  the  perfect  active  is 
kekophamen.  Similarly,  all  nouns  derived  from  this  root  have  an  o-vocalism:  for 
example,  k&nmatos,  genitive  singular,  'that  which  is  struck';  kopos,  nominative  singu¬ 
lar,  'a  striking';  and  kopis,  nominative  singular,  'a  chopper'.  Furthermore,  all  pure 
noun  roots  never  exhibit  ablaut.  Certain  derivational  and  inflectional  endings,  however, 
do  show  it,  and  in  those  cases  in  which  it  appears,  it  can  be  handled  by  rule  A  as  before. 
For  example,  the  neuter  affix  es  appears  as  os  in  the  nominative  and  accusative  sin¬ 
gular.  But  since  there  is  a  rule  in  Greek  which  deletes  the  nominative  and  accusative 
singular  marker  in  all  neuter  nominals  not  formed  with  the  affix  o,  this  alternation  fol¬ 
lows  immediately  from  rule  A.  Consider  the  nominative  and  accusative  singular  of 
forms  9  and  19: 

20.  (#(+blepfes)+#)  :  bldpos 

21.  (#(+dei+es)+#)  ddos 

The  masculine  affix  o  of  examples  11  and  17  appears  as  e  in  the  vocative  singular 
because  the  vocative  singular  marker  has  been  deleted,  so  that  rule  A  applies  one  less 
time  to  it.  The  affix  eu,  however,  does  not  change  in  the  vocative  singular. 

The  theme  of  the  verbal  conjugation,  which  we  have  written  e,  also  undergoes  ablaut, 
but  rule  A  alone  cannot  give  the  correct  results.  If,  however,  we  suppose  that  there 
is  also  a  low-level  phonetic  rule  in  Greek, 

e  —  o  in  env _ +  [-t-nasal], 

then  the  alternations  of  the  theme  vowel  can  be  handled. 

It  is  convenient,  then,  to  set  up  a  class  of  ablauting  vowels  in  Greek:  the  e.  of  verbal 
roots,  and  the  vowels  of  certain  inflectional  morphemes.  We  are  not  forced,  however, 
to  mark  the  distinction  between  ablauting  and  nonablauting  vowels  phonemically;  we 
require  only  a  morphophonemic  rule  that  will  specially  mark  the  ablauting  vowels,  and 
an  adjustment  to  rule  A  which  will  allow  it  to  operate  only  on  those  vowels  that  are  so 
marked. 

I  am  at  present  working  on  a  system  of  rules  which  will  account  for  the  loss  of  vowels 
in  certain  positions  (the  zero-grade  phenomenon),  vowel  lengthening,  and  accent  place¬ 
ment.  This  entire  system  is  also,  apparently,  part  of  the  transformational  cycle  in 

®ree*t'  D.  T.  Langendoen 
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A.  DESIGN  PHILOSOPHY  FOR  PSYCHOPHYSICAL  EXPERIMENTS  RELATED  TO 

A  THEORY  OF  AUDITORY  FUNCTION 

Psychophysics  should  play  a  major  role  in  the  evolution  of  a  theory  of  auditory  func¬ 
tion,  together  with  anatomy  and  physiology.  We  feel  that  the  psychophysical  tests  that 
will  be  most  valuable  for  this  purpose  are  those  that  can  be  related  to  the  physiological 
information  now  available  about  hearing.  In  terms  of  this  objective  of  relating  psycho¬ 
physical  and  physiological  data,  it  is  possible  to  argue  for  a  "best"  psychophysical  test 
procedure. 

A  good  deal  is  known  about  the  mechanical  properties  of  the  ear,1  and  about  the 

2 

neural  activity  at  the  level  of  the  acoustic  nerve,  that  is,  about  what  might  be  called 
the  peripheral  auditory  system.  In  particular,  it  is  now  clear  that  some  information 
about  the  external  acoustical  stimulus  is  destroyed  by  the  peripheral  auditory  system, 
and  that  psychophysical  test  performance  is  thereby  fundamentally  limited.  By  utilizing 
the  available  physiological  information  about  the  periphery  in  a  statistical  hypothesis 
testing  formulation,  the  limits  on  performance  imposed  by  the  periphery  in  certain 
psychophysical  tests  could  be  described  quantitatively,  except,  perhaps,  for  a  few 
unknown  parameters  and  functional  forms.  The  more  central  system  also  imposes 
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limits  on  performance,  but  these  more  central  limitations  cannot  be  described  quanti¬ 
tatively  on  the  basis  of  physiological  information,  as  present  understanding  of  what  is 
done  centrally  with  the  information  available  from  the  periphery  is  relatively  incomplete. 
It  may  be  that  in  some  psychophysical  tests,  the  dominant  limitations  on  performance 
are  peripheral;  that  is,  the  central  processing  is  effectively  as  good  as  the  mathemat¬ 
ically  optimal  processing.  If  such  is  the  case,  the  quantitative  description  of  peripheral 
limitations  for  these  tests  should  correctly  describe  the  characteristics  of  the  psycho¬ 
physical  data.  Since  some  psychophysical  data  must  be  used  to  complete  the  formulation 
of  peripheral  bounds,  the  only  fact  that  can  really  be  demonstrated  experimentally  is 
that  there  exists  a  class  of  psychophysical  tests  in  which  the  data  are  as  if  peripheral 
bounds  were  dominant.  We  can  look  for  a  class  of  psychophysical  tests  in  which  the 
data  from  one  test  are  related  to  the  data  from  another  in  the  same  way  as  they  would 
be  if  peripheral  limitations  were  dominant,  but,  even  if  we  are  successful,  we  cannot 
claim  to  have  demonstrated  that  peripheral  limitations  are  really  dominant.  However, 
this  approach  seems  to  be  a  legitimate  and  promising  way  in  which  to  try  to  relate 
physiological  and  psychophysical  phenomena. 

If  we  subscribe  to  this  approach,  we  must  look  first  for  psychophysical  experiments 
that  can  be  related  to  the  statistical  hypothesis  testing  formulation  of  peripheral  bounds 
and  hold  some  promise  of  being  tests  in  which  peripheral  factors  are  dominant.  Given 
these  conditions,  we  can  argue  for  Beveral  desirable  attributes  of  psychophysical  tests: 

(i)  The  tests  should  involve  discriminations  rather  than  absolute  recognitions. 

(ii)  The  subject  should  be  thoroughly  informed  about  every  aspect  of  the  experi¬ 
mental  design  except  the  actual  sequence  of  presentations  to  be  used  (the  subjects  used 
in  the  experiments  described  here  were  graduate  students  in  electrical  engineering,  and 
hence  had  no  difficulty  in  understanding  a  quantitative  description  of  the  stimuli). 

(iii)  The  subject  should  be  asked  to  choose  from  a  finite  and  preferably  small  set 
of  responses,  say,  two. 

(iv)  The  stimulus  parameters  should  unambiguously  define  which  response  is  cor¬ 
rect  and  the  subject  should  be  informed  of  which  response  was  correct  after  each  trial. 

(v)  The  subject  should  be  asked  to  use  all  of  the  information  available  to  him  to  do 
as  well  as  possible  in  terms  of  some  scoring  system. 

(vi)  There  should  be  a  minimum  of  different  possible  stimuli,  say,  two. 

These  pointB  lead  to  consideration  of  two  general  types  of  discrimination  tests,  each 
of  which  involves  only  two  possible  stimuli  and  two  possible  responses.  In  one  config¬ 
uration,  the  subject  is  presented  stimulus  A  followed  by  stimulus  A  (AA),  or  stimulus 
A  followed  by  stimulus  B  (AB),  and  asked  whether  the  two  stimuli  are  the  same  or  dif¬ 
ferent.  In  the  other  configuration  the  subject  is  presented  either  AB  or  BA  and  asked 
in  which  position  is  B. 

A  series  of  experiments  in  frequency  discrimination  was  run  to  evaluate  the  relative 
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merits  of  variations  of  these  two  presentations.  Stability  of  performance  was  a  major 
consideration  in  evaluating  the  various  psychophysical  test  schemes,  since  unstable 
behavior  was  regarded  as  a  clear  indication  that  peripheral  bounds  were  not  dominant. 

We  found  that  the  symmetric  presentations  —  those  in  which  either  pair  occurred  with 
a  priori  probability  l/2  -  were  better  in  terms  of  stability  than  the  asymmetric.  This 
conclusion  was  supported  by  the  subjective  feelings  of  the  observers  and  the  quantita¬ 
tive  aspects  of  the  experimental  data.  We  also  found  that  the  High- Low  (AB,  BA)  presen¬ 
tation  was  better  in  terms  of  stability  than  the  Same-Different  (AA,  AB).  The  subjects 
preferred  the  (AB,  BA)  configuration  because  they  felt  that  high-low  was  exactly  the 
question  they  first  answered  in  listening  to  the  stimuli,  whereas  in  the  (AA,  AB)  con¬ 
figuration  the  stimuli  all  sounded  different  and  they  asked  themselves  "different  enough?" 
The  subjects'  feelings  were  reflected  quantitatively  in  the  data,  with  the  (AB,  BA)  data 
being  significantly  more  stable  than  the  (AA,  AB).  We  feel  that  the  two-alternative 
forced-choice  experiment  (AB,  BA)  would  also  be  "best,”  given  the  goal  of  relating 
physiological  to  psychophysical  data,  in  tests  of  discrimination  other  than  frequency 
and  in  detection  tests. 

Further  details  of  these  arguments,  the  statistical  hypothesis  testing  formulation, 
and  the  experiments  referred  to  in  this  report  may  be  found  in  the  author's  thesis.3 

P.  R.  Gray 
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B.  EVOKED  RESPONSES  IN  RELATION  TO  VISUAL  PERCEPTION  AND  OCULO¬ 
MOTOR  REACTION  TIMES  IN  MAN 

[This  report  is  a  summary  of  an  invited  paper  read  at  the  Conference,  "Sensory 
Evoked  Response  in  Man,"  sponsored  by  the  New  York  Academy  of  Sciences  and  Albert 
Einstein  College  of  Medicine,  New  York,  February  14-15,  1963.  The  paper  will  be  pub¬ 
lished  as  a  part  of  a  Conference  Monograph  in  the  Annals  of  the  New  York  Academy 
of  Sciences. 1  ’ 

Responses  evoked  at  the  occiput  in  man  by  visual  stimuli  have  been  studied  in  rela¬ 
tion  to  oculomotor  reaction  times,  and  in  relation  to  some  of  the  associated  phenomena 
of  visual  perception.  Sudden  changes  in  the  vertical  position  of  a  spot  on  an  oscilloscope 
screen  were  used  as  stimuli  and  as  tracking  signals,  and  motor  responses  consisted 
of  the  corresponding  redirections  of  gaze  to  the  successively  new  positions  of  the  spot. 
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For  subsequent  analysis,  simultaneous  recordings  were  made  onto  magnetic  tape  and 
paper  of:  (a)  spot  position,  (b)  eye  position  (monitored  electro-oculographically),  and 
(c)  EEG  potentials  recorded  bipolarly  from  a  pair  of  midline  parieto-occipital  scalp 
electrodes. 

Averages  of  parieto-occipital  evoked  potentials  for  a  large  number  of  stimuli  were 
obtained  from  recordings  made  during  visual  tracking  of  the  spot,  and  also  from 
recordings  carried  out  while  gaze  was  maintained  fixed  at  the  center  of  the  oscilloscope 
screen.  Also,  averaged  responses,  with  eyes  closed,  to  flashes  of  light  from  a  strobo¬ 
scope  were  obtained.  Oculomotor  reaction  times  were  determined  by  averaging  of 
electro- oculographic  potentials,  and  by  crosscorrelation  of  the  latter  with  the  spot- 
position  signals. 

In  separate  psychophysical  experiments  with  some  subjects,  an  attempt  was  made 
to  determine,  by  indirect  means,  the  approximate  time  interval  for  subjective  percep¬ 
tion  of  the  stimuli,  and  for  subjective  perception  of  eye  movements,  for  comparison 
with  the  latencies  of  various  components  of  the  parieto-occipital  evoked  responses. 

Potentials  evoked  by  the  shifting  spot  of  light  were  generally  appreciably  smaller, 
and  greater  in  latency,  than  those  evoked  by  bright  flashes  of  light  with  the  eyes  closed, 
but  the  amplitude  of  certain  components  of  the  former  (that  is,  those  with  an  output  at 
approximately  80-90  msec)  was  clearly  increased  during  tracking  for  some  subjects, 
an  effect  that  may  have  been  due  to  the  fact  that  the  fovea  of  the  retina  was  more  fre¬ 
quently  stimulated  during  tracking  than  it  was  when  the  eyes  were  maintained  fixed. 
Consideration  of  the  latencies  of  these  components  suggests  that  they  may  represent 
nonspecific  rather  than  specific  evoked  responses  in  the  visual  system.  Comparison 
of  averages  of  EEG  responses  with  those  of  EOG  responses  established  that  the  above- 
mentioned  components  appearing  during  tracking  did  not  represent  the  electrical  field, 
at  the  parieto-occipital  scalp  electrodes,  of  the  EOG  potentials,  the  amplitude  of  the 
latter  being  much  larger  than  that  of  the  EEG  potentials.  Suggestive,  but  not  con¬ 
clusive,  evidence  for  different  forms  of  evoked  responses  for  different  directions  of 
spot-shifts  were  obtained  for  some  subjects. 

A  comparison,  for  one  subject,  of  the  results  of  the  psychophysical  experiments 
with  those  of  the  electrophysiological  recordings  provided  some  tentative  evidence  that 
the  subjective  perception  of  spot-shifts  takes  place,  on  the  average,  rather  early 
(that  is,  within  SO  msec),  a  finding  that  suggests  that  visual  perception  of  simple 
stimuli  may  be  more  closely  related  to  specific  visual  pathways  than  to  nonspecific 
ones.  Only  minimal  evidence  of  an  early,  specific  response  was,  however,  obtained 
in  the  present  series  of  experiments  for  spot-shifts  as  visual  stimuli.  It  appears 
probable  that  a  component  appearing  in  the  averaged  parieto-occipital  EEG  response 
at  a  constant  interval  of  time  after  the  mean  oculomotor  reaction  time  represents 
the  same  basic  phenomenon  as  that  of  the  "lambda  waves"  following  eye  movements 
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that  have  been  described  previously. 

The  findings  from  these  electrophysiological  and  psychophysical  experiments  are 
discussed  in  relation  to  the  question  of  quantization  of  time  in  the  nervous  system. 

This  work  was  supported  in  part  at  the  Massachusetts  General  Hospital  by  a  U.  S. 
Public  Health  Service  career  program  award  (Number  5-K3-NB-9201),  and  by  a  Public 
Health  Service  research  grant  (Number  B-3752),  from  the  National  Institute  of  Neuro¬ 
logical  Diseases  and  Blindness. 

J.  S.  Barlow 

C.  A  MODEL  FOB  FIRING  PATTERNS  OF  AUDITORY  NERVE  FIBERS 

Acoustic  signals  are  represented  by  sequences  of  spike  potentials  in  the  auditory 
(VIIIth)  nerve.  The  form  of  this  code  and  the  encoding  mechanism  have  been  subjects 
of  considerable  interest  in  physiology.  Recent  experimental  results  of  Kiang  and  others1 
have  revealed  the  highly  systematic  structure  of  the  spike  activity  of  fibers  in  the  VIIIth 
nerve  of  cats.  These  results  have  led  to  the  work,  summarized  here,  on  the  construc¬ 
tion  and  testing  of- a  model  of  the  peripheral  auditory  syBtem  that  relates  the  firing  of 
an  VIIIth-nerve  fiber  to  sound  coming  into  the  ear. 

Sound  enters  the  outer  ear,  impinges  on  the  eardrum,  and  is  transmitted  through 
the  middle-ear  structures  to  the  fluids  of  the  cochlea  or  inner  ear.  The  fluid  motion 
results  in  motion  of  the  cochlear  partition,  which  in  turn  results  in  forces  on  and/or 

movements  of  the  auditory  receptor  cells  or  hair  cells.  The  hair  cells  are  thought  to 

til 

be  transducer  elements  whose  function  is  to  produce  local  excitation  of  nearby  VIII  - 
nerve  fibers.  The  spike  potentials  resulting  from  this  excitation  are  transmitted  to  the 
brain  via  the  VIIIth  nerve. 

Figure  XV- 1  shows  a  model  of  this  system.  The  "Mechanical  System"  represents 
the  functional  relationship  between  an  acoustic  pressure  input  to  the  ear  and  a  displace¬ 
ment  of  the  cochlear  partition  and  is  assumed  to  be  representable  as  a  linear  system 
over  a  range  of  approximately  80  db  of  sound  intensity.  The  transfer  function  of  this 

part  of  the  system  for  a  particular  point  along  the  cochlear  partition  is  assumed  to  be 

2 

given  by  the  work  of  von  Blklsy  for  frequencies  below  approximately  2  kc.  The 
"Transducer"  is  intended  to  represent  the  functional  relationship  between  the  displace¬ 
ment  of  the  cochlear  partition  at  a  point  along  its  length  and  the  output  of  a  hair  cell  at 
that  point.  The  final  block  shows  a  stereotyped  or  idealized  "Model  Neuron."  In  this 
model  the  output  of  the  transducer  is  summed  with  noise  at  the  input  of  the  neuron.  This 
noise  is  included  to  account  for  both  the  spontaneous  activity  and  the  probabilistic 
response  behavior  characteristic  of  VIUth-nerve  fibers.  The  existence  of  this  noise 
(additive  to  the  membrane  potential)  can  be  justified  to  some  extent  on  the  basis  of 

3 

extrapolations  from  some  empirical  evidence  that  is  due  to  Verveen.  The  sum  of  the 
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p(t).  Pressure  at  the  ear  drum 

y(t,x).  Displacement  of  the  cochlear  partition 

*(t,x),  Output  of  hair  cell 
f(t,x),  Action  potentials  of  fiber 


h(t,x),  Impulse  response  of  the  cochlear  partition 
at  a  point  x  cm  from  the  stapes 
G(y),  A  function  relating  the  output  of  a  hair  cell 
to  displacement  of  the  cochlear  partition  at 
a  point  x  cm  from  the  stapes 
g(t).  Impulse  response  of  a  linear  filter 
r<t,x).  Threshold  of  the  neuron  model  at  time  t 


Fig.  XV-1.  Model  relating  the  firing  patterns  of  fibers 
in  the  auditory  nerve  to  acoustic  stimuli. 


noise  and  the  input  to  the  neuron  is  filtered  and  then  compared  with  a  threshold  in  the 
box  labelled  "C."  If  the  threshold  is  exceeded,  then  a  spike  is  defined  as  occurring, 
and  the  threshold  is  reset  to  some  larger  value,  by  the  box  labelled  "R." 


Fig.  XV-2.  Threshold  and  membrane  potential  of  the  model  neuron  as  a  function  of  time. 
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Figure  XV -2  shows  both  the  noisy  membrane  potential  of  the  model  neuron  and  the 
threshold  as  a  function  of  time.  The  threshold  is  reset  to  some  large  value  (tMax)  upon 
the  occurrence  of  a  spike  and  decays  to  its  resting  value  (tRest)  with  a  decrement  (tine). 
In  the  lower  right-hand  corner  there  is  a  representation  of  the  membrane  and  threshold 
potentials  generated  by  the  TX-2  computer  simulation  of  this  system.  The  top  trace 
shows  thj  noisy  membrane  potential.  The  center  trace  shows  the  threshold  as  a  function 
of  time.  The  bottom  trace  of  dots  shows  the  times  of  occurrence  of  spikes  and  a  set  of 
time  markers. 

The  fundamental  assumptions  in  this  model  are: 

(i)  The  mechanical  system  is  assumed  to  be  representable  by  a  linear  system. 

(ii)  A  point-to-point  relation  between  the  displacement  of  the  cochlear  partition 
and  the  neural  excitation  is  assumed.  A  particular  neural  fiber  is  assumed  to  be 
excited  by  a  neighboring  hair  cell,  which  in  turn  responds  to  the  displacement  of  the 
cochlear  partition  at  a  point  along  its  length. 

(iii)  The  process  of  neural  excitation  is  represented  by  a  simple  model  neuron. 

This  model  is  probabilistic  and  characterized  by  threshold  and  refractory  effects. 

(iv)  The  effects  of  efferent  fibers  on  the  peripheral  system  are  ignored. 

This  model  has  been  simulated  on  the  TX-2  computer  at  Lincoln  Laboratory,  M.I.T., 


INTERVAL  (MILLISECONDS) 

Fig.  XV -3.  Histogram  of  intervals  between  spikes  generated 
by  the  model  in  the  absence  of  a  stimulus.  The 
time  constant  of  the  decay  of  the  threshold  to  its 
resting  value  is  1/3  msec.  The  noise  spectrum 
is  flat  from  10  cps  to  5  kc. 
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Fig.  XV-4. 

Impulse  response  of  the  cochlear  parti¬ 
tion  at  a  point  whose  characteristic  fre¬ 
quency  is  500  cps.  Dot  separation  is 
100  psec,  and  successive  maxima  of  the 
response  are  separated  by  2  msec.  This 
particular  impulse  response  corresponds 

A 

to  F3(t)  (see  Flanagan  ). 


CLICK  RESPONSE  OF  MODEL 

WITH  NONLINEARITY  SERIAL  *  ♦  4028 

At- 100  MICROSECS  CF-500* 

2/6/63  22 • 40 

PST  HISTOGRAM 

SF--  3  •  BINS-*  140 

•  SPIKES-*  3729 


H  h“  2  MILLISECONDS 


Fig.  XV-5.  PST  histogram  of  the  click  response  of  the  model  for  a  fiber 
whose  characteristic  frequency  is  500  cps. 
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and  statistics  of  the  spontaneous  activity  and  response  of  the  model  to  a  variety  of  acous¬ 
tic  stimuli  have  been  studied  and  compared  with  the  VIIIth -nerve  data  obtained  by  Kiang 
and  others.* 

Figure  XV -3  shows  a  histogram  of  interspike  intervals  of  the  spontaneous  activity 
generated  by  the  model.  The  insert  shows  this  histogram  as  photographed  from  the 
TX-2  oscilloscopic  output.  The  rest  of  the  figure  shows  the  interval  histogram  plotted 
in  semi-logarithmic  coordinates.  Notice  that  the  tail  of  the  distribution  is  linear  with 
these  coordinates,  or  exponential  on  linear  coordinates.  For  very  short  intervals,  the 
the  number  of  firings  decreases  to  zero,  a  reflection  of  the  refractory  properties  of 
the  model.  This  general  form  of  the  distribution  of  intervals  in  the  spontaneous  (or 
unstimulated)  case  is  characteristic  of  VIUth-nerve  fibers. 

Figure  XV -4  shows  the  response  of  the  cochlear  partition  to  a  short  click  determined 
4 

by  Flanagan  from  the  data  of  von  Bdkdsy.  In  this  case,  the  response  is  given  for  a  point 

along  the  cochlear  partition  whose  characteristic  frequency  is  500  cps.  Figure  XV-5 

shows  the  response  of  a  model  fiber  that  is  assumed  to  innervate  a  hair  cell  at  the  same 

spot  on  the  cochlear  partition.  The  response  is  in  the  form  of  a  PST  (Post  Stimulus 

Time)  histogram.  This  histogram  shows  the  number  of  firings  that  occur  at  a  time  t 

after  the  onset  of  a  stimulus.  The  peaks  in  this  histogram  are  separated  by  an  interval 

of  2  msec  and  correspond  to  the  times  of  the  maximum  positive  deflections  in  the 

response  of  the  cochlear  partition  at  the  500-cps  point  to  the  click.  This  general  form 

of  the  model  PST  histogram  agrees  quite  well  with  the  form  of  similar  histograms  com- 
th 

puted  from  the  VIII  -nerve  data. 

Figure  XV -6  shows  PST  histograms  of  the  click  response  of  the  model  for  two  dif¬ 
ferent  forms  of  the  transducer  as  a  function  of  sound  intensity.  The  histograms  on  the 
left  side  show  the  results  for  a  linear  transducer,  that  is,  the  output  of  the  transducer 
is  proportional  to  its  input.  Note  that  there  is  a  36 -db  range  between  the  intensity  at 
which  a  response  first  appears  to  be  visible  in  the  PST  histogram  and  the  intensity  at 
which  the  response  of  the  model  becomes  stereotyped  (that  is,  the  model  responds  to 
the  largest  positive  deflection  of  the  cochlear  partition  every  time  a  stimulus  is  pre¬ 
sented.)  This  behavior  does  not  agree  with  the  observed  response  of  VIUth-nerve  fibers. 
Since  the  envelope  of  the  impulse  response  of  the  cochlear  partition  decays  at  a  rate 
exceeding  20  db  per  cycle  of  oscillation,  there  can  be  at  most  two  or  three  peaks  in  the 
PST  histogram  when  a  linear  transducer  model  is  used.  The  linear  transducer  function 
thus  yields  PST  histograms  that  are  at  variance  with  the  empirical  evidence  of  Kiang 
and  others.  The  histograms  on  the  right  side  of  Fig.  XV-6  show  the  results  obtained 
for  a  transducer  model  that  is  a  nonlinear  saturation  function.  The  results  obtained 
with  the  nonlinear  transducer  model  agree  qualitatively  with  the  results  of  Kiang  and 
others. 

One  of  the  puzzling  results  of  the  investigations  of  Kiang  and  others  indicates  that 


QPR  No.  69 


221 


i 


SCALE  INTENSITY 

FACTOR  <DB) 


SCALE 

FACTOR 


At  ■  lOOpMC 

CF  ■  500  cps 

Fig.  XV-6.  PST  histograms  of  the  click  response  of  the  model  for  two 
different  transducer  functions  vs  intensity  of  click.  The  left- 
column  histograms  were  computed  for  a  linear  transducer. 
The  right-column  histograms  were  computed  for  a  nonlinear 

transducer  |  G(y)  =  y  — ^ — -1 . 
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the  sensitivity  of  neurons  responding  to  sinusoids  at  their  characteristic  frequency 
increases  as  a  function  of  frequency  (at  a  rate  of  approximately  40  db/decade)  in  the 

5 

range  100  cps-2  kc.  Such  a  change  in  sensitivity  is  seen  at  higher  auditory  centers 

and  even  in  the  human  audiogram,*1  but  it  was  surprising  to  encounter  this  effect  so  close 

to  the  periphery,  particularly  since  the  mechanical  part  of  the  system  does  not  exhibit 

so  large  a  change  in  sensitivity.  Preliminary  results  tend  to  indicate  that  this  effect 

can  be  explained  by  the  model.  That  is,  the  model  predicts  a  change  in  sensitivity  as 

a  function  of  frequency  which  is  qualitatively  the  same  as  the  change  in  sensitivity  seen 
th 

in  VIII  -nerve  fibers,  but  the  quantitative  results  depend  upon  the  definition  of  the 
threshold  of  firing  of  a  unit  and  a  knowledge  of  the  exact  parameters  of  the  system. 

To  summarize,  this  model  of  the  peripheral  auditory  system  seems  capable  of  fitting 
certain  available  VUIth-nerve  data  over  a  considerable  range  of  stimuli.  The  important 
constituents  of  the  model  are  a  linear  bandpass  filter,  followed  by  a  nonlinear  saturation 
function,  followed  by  a  probabilistic  threshold  device  with  refractory  properties. 

T.  F.  Weiss 
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D.  CORTICAL  FACILITATION  FOLLOWING  ONSET  OR  TERMINATION  OF  A  LIGHT 

The  present  report  concerns  the  shock-evoked  response  complex  (SERC)  recorded 
from  the  primary  visual  cortex.  This  response  has  been  the  subject  of  considerable 
study.  The  complex  of  deflections  is  obtained  whether  one  delivers  the  "test"  shocks 
to  the  optic  nerve,  chiasm,  tract,  the  lateral  geniculate  body  or  the  optic  radiation. 
(These  deflections  are  usually  numbered  as  illustrated  in  Fig.  XV-8,  trace  D.  ’  )  There 
is  general  agreement  that  deflection  no.  1  is  related  to  the  volley  of  impulses  in  nerve 
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fibers  that  feed  into  the  cortex,  and  that  deflections  no.  3,  no.  4,  and  no.  5  are  measures 
of  intracortical  events  resulting  from  this  influx.  The  SERC  for  a  given  size  of  test 
shock  may  be  modified  by  changing  the  animal's  physiological  state,  or  by  changing  con¬ 
ditions  of  retinal  stimulation.  The  most  striking  modification  of  this  SERC  is  an 
enhancement,  which  has  been  demonstrated  under  a  number  of  conditions,  including 
arousal  by  reticular  stimulation  or  other  means,  *  presentation  of  a  continuous  diffuse 
illumination, 5  and  retinal  deafferentation.6  Besides  these  effects  which  last  long,  there 
are  transient  changes  in  the  SERC  following  a  sensory  or  arousing  stimulus.  In  this 
report  we  shall  deal  with  such  a  transient  effect;  specifically,  with  a  strong  enhancement 
following  the  onset  or  termination  of  the  presentation  of  a  light.  The  effect  is  most 
clearly  demonstrated  when  the  shocks  are  delivered  to  the  optic  radiation  rather  than  to 
more  peripheral  loci  in  the  afferent  pathway,  and  it  is  abolished  by  deep  barbiturate 
anesthetization.  The  experiments  were  performed  with  unanesthetized  cats  with  mid- 
pontine  pretrigeminal  brain-stem  sections. 

Recording  was  by  a  monopolar  electrode  (silver  ball  or  cotton  wick  soaked  in  NaCl 
solution),  with  the  neutral  electrode  on  the  skin  of  the  neck.  Shocks  were  delivered  by 
concentric  electrodes.  The  light  source  was  a  Westinghouse  47  bulb  placed  in  the  end 
of  a  hollow  cylinder,  18  cm  long  and  3  cm  in  diameter.  At  the  other  end  of  the  cylinder 
was  a  glass  diffuser  that  was  placed  directly  in  front  of  the  cat's  eyes  at  a  distance  of 
approximately  8  cm.  The  bulb  was  excited  by  a  6-volt  dc  source.  The  electrocortical 


At 


RETICULAR  STIMULUS 

Fig.  XV-7.  Temporal  pattern  of  stimuli.  Top  line  shows  the  timing  of  the  shocks  to 
the  optic  pathway.  Stimuli  are  delivered  repetitively  at  the  rate  of  1  per 
2  seconds  or  1  per  4  seconds.  In  control  conditions  the  cat  is  in  the  dark. 
When  light  or  reticular  stimulation  is  coupled  with  the  shocks  the  time 
sequence  is  as  shown,  with  At  the  time  between  the  onset  of  the  electric 
signal  to  the  light,  or  the  train  of  shocks  to  the  reticular  formation  (repre¬ 
sented  by  a  solid  bar)  and  the  shock  to  the  optic  pathway.  (For  the  time 
course  of  the  light  onset  see  Fig.  XV-8.)  Duration  of  the  light  stimulus  is 
1  sec;  duration  of  the  train  of  shocks  to  the  reticular  formation,  0.  1  sec. 
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activity  was  recorded  on  magnetic  tape  and  processed  by  the  ARC-1  average  response 
computer. 

Figure  XV-7  illustrates  the  temporal  pattern  of  stimuli  used  in  the  experiments.  The 
average  waveform  of  cortical  responses  to  the  onset  of  the  light  is  shown  in  Fig.  XV-8, 
trace  A.  Figure  XV-8,  trace  B  shows,  on  the  same  time  and  amplitude  scale  as  in 
trace  A,  the  average  of  responses  to  the  light  together  with  a  single  shock  to  the  optic 


5  MSEC 


Fig.  XV-8.  Evoked  responses  from  primary  visual  cortex.  A:  average  of  responses 
to  the  onset  of  the  light.  B:  average  of  responses  to  the  onset  of  the  light 
together  with  a  shock  to  the  optic  radiation  homolateral  to  the  recording 
electrode  delivered  at  the  time  indicated  by  the  arrow.  C:  time  course 
of  the  light  as  measured  by  a  photomultiplier  tube.  The  trace  starts  at  the 
onset  of  the  electric  signal  to  the  bulb.  D:  detail  of  the  part  of  the  wave¬ 
form  in  B  following  the  shock.  The  shock  artefact  (s.  a.)  and  characteris¬ 
tic  deflections  of  the  shock-evoked  response  complex  (SERC)are  indicated 
in  this  and  subsequent  figures.  The  shocks  are  4  v  in  amplitude,  and  0.2 
msec  in  duration,  and  presented  once  every  4  seconds.  In  this  and  subse¬ 
quent  figures  the  number  of  responses  in  each  average  is  32. 
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radiation  delivered  at  the  time  marked  by  the  arrow.  Figure  XV-8,  trace  C  shows  the 
time  course  of  the  onset  of  the  light.  It  is  clear  that  the  SERC  is  larger  and  of  much 
shorter  duration  than  the  response  to  the  light  alone,  so  that  large  changes  in  the  sizeB 
of  its  deflections  cannot  result  from  an  addition  of  these  two  responses.  Figure  XV-8, 
trace  D  shows  a  more  detailed  view  of  the  SERC  on  a  different  time  scale. 

The  enhancement  effect  is  illustrated  in  Fig.  XV-9.  Figure  XV-9,  trace  A  shows 
the  SERC  (shocks  delivered  to  the  optic  radiations)  with  control  (dark)  conditions. 
Figure  XV-9,  trace  B  shows  the  SERC  when  the  shocks  are  delivered  150  msec  following 
the  electric  signal  to  the  light  source.  (Note  that  there  is  an  approximate  30-msec  delay 
between  this  event  and  the  light  onset,  see  Fig.  XV-8,  trace  C.)  It  is  interesting  to 


A 
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Fig.  XV-9.  Enhancement  of  the  SERC  by  the  onset  of  a  light.  A:  average  of  responses 
to  shocks  to  the  optic  radiation  under  control  conditions  of  darkness. 
B:  average  of  responses  to  shocks  together  with  the  presentation  of  light. 
The  third  trace  is  the  difference  .between  the  waveforms  obtained  in  B  and 
A.  The  shocks  were  given  150  msec  after  the  electric  signal  to  the  light. 
Other  stimulus  parameters  are  the  same  as  in  Fig.  XV-8. 
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SHOCKS  TO  OPTIC  CHIASM  SHOCKS  TO  OPTIC  RADIATION 


Fig.  XV-10.  Modification  of  the  SERC  by  the  onset  of  a  light  and  by  reticular  stimula¬ 
tion.  Left  column  shows  averages  of  responses  to  shocks  to  the  optic 
chiasm,  and  their  modification  by  the  onset  of  a  light  and  by  the  onset  of 
a  train  of  shocks  to  the  reticular  formation.  Right  column  gives  similar 
results  for  shocks  delivered  to  the  optic  radiation.  The  stimuli  were 
delivered  repetitively  at  a  rate  of  1  per  2  seconds.  The  shocks  to  the 
chiasm  were  4  v  in  amplitude,  0.2  msec  in  duration;  those  to  the  optic 
radiation  were  3  v  in  amplitude,  0.2  msec  in  duration.  Reticular  stim¬ 
uli  were  0.  1-sec  trains  of  shocks  having  2-volt  amplitude,  300/sec  rate, 
and  0.2-msec  duration. 

observe  that  only  deflections  no.  3,  no.  4,  and  no.  5  are  modified  significantly.  The 
differential  waveform  of  Fig.  XV-9  illustrates  this  point.  A  similar  enhancement  effect 
may  be  demonstrated  following  termination  of  the  light. 

The  rate  of  presentation  of  the  light  is  important  in  determining  the  extent  of 
enhancement.  For  rates  of  1  per  second,  or  greater,  the  enhancement  is  small.  In 
several  cases  a  considerably  greater  enhancement  was  observed  for  the  rate  of  1  per 
4  seconds  than  for  the  rate  of  1  per  2  seconds.  The  enhancement  effect  is  obtained  for 
a  wide  range  of  light  intensities,  and  can  be  demonstrated  with  the  stimulating  light 
shining  directly  into  the  animal's  eyes,  or  with  the  light  overhead. 

In  a  number  of  experiments  we  have  attempted  to  enhance  the  SERC  by  retic¬ 
ular  stimulation.  In  these  midpontine  pretrigeminal  preparations  such  an  enhance¬ 
ment  is  possible,  but  difficult  to  obtain.  The  effect  is  extremely  sensitive  to 
the  location  of  the  stimulating  electrode  in  the  reticular  formation.  When  an 
enhancement  is  obtained  it  occurs  in  the  period  following  the  onset  of  the  train 
of  shocks  to  the  reticular  formation,  and  cannot  be  demonstrated  during  a  con¬ 
tinuous  reticular  stimulation.  This  enhancement  is  greatest  if  the  test  shocks 
are  delivered  to  the  optic  pathway  approximately  200  msec  after  the  onset  of 
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the  train  of  shocks  to  the  reticular  formation.  A  100. msec  train  of  shocks  to 
the  reticular  formation  seems  to  provide  as  much  enhancement  as  a  longer  one. 
Figure  XV- 10  shows  enhancement  of  the  SERC  for  shocks  to  the  optic  chiasm 
and  enhancement  of  the  SERC  to  shocks  to  the  optic  radiations  following  retic¬ 
ular  stimulation  (third  waveform  in  each  column).  Note  that  in  both  cases  the 
size  of  deflection  no.  1  is  practically  unchanged,  while  deflections  no.  3,  no.  4, 
and  no.  5  are  enhanced. 

Figure  XV- 10  also  shows  corresponding  modification  of  the  SERCs  for  shocks  to 
the  optic  chiasm  and  optic  radiations  following  the  onset  of  a  light  (second  waveform  in 
each  column).  For  shocks  delivered  to  the  optic  radiations  results  are  very  similar  to 
those  obtained  with  reticular  stimulation.  However,  for  shocks  delivered  to  the  optic 
chiasm  the  modification  is  more  complex;  deflection  no.  1  is  smaller  than  in  the  control, 
while  the  later  events  show  some  enhancement. 

We  interpret  these  results  as  follows:  Following  the  onset  or  termination  of  a  light 
there  is  a  facilitation  of  the  neurons  of  the  primary  visual  area  of  the  cortex.  This  facil¬ 
itation  is  demonstrated  by  the  "test  probe*  of  a  shock  to  the  optic  radiations.  There  is 
an  enhancement  of  the  intracortical  events  3,  4,  and  5  in  the  SERC  when  the  test  shock 
is  delivered  in  the  appropriate  time  interval  following  the  onset  or  termination  of  the 
light.  The  modification  of  the  SERC  when  the  test  shock  is  delivered  to  the  chiasm 
demonstrates  an  inhibitory  effect  which  is  caused  by  the  retinal  activity  following  the 
onset  of  the  light.  (Arduini  and  his  co-workers^’7  have  hypothesized  and  demon¬ 
strated  that  such  an  inhibition  exists.)  This  inhibitory  effect  is  evident  in  the  decrease 
of  deflection  no.  1  (see  Fig.  XV- 10).  The  facilitation  of  the  cortical  neurons  following 
the  flash  is  still  evident  in  enhancement  of  the  later  deflections. 

The  mechanism  of  the  enhancement  effect  is  being  investigated.  Similarity  of  the 
waveforms  for  enhancement  following  the  light  onset  and  following  a  reticular  stimulation 
suggests  that  the  effects  may  involve  the  reticular  formation.  An  important  feature  of 
the  enhancement  is  that  it  disappears  completely  following  deep  anesthetization  by  Nem¬ 
butal,  which  would  not  be  at  variance  with  this  suggestion.  However,  there  are 
findings  that  cast  doubt  upon  a  reticular  involvement.  First,  there  is  the  difficulty 
of  obtaining  the  enhancement  by  reticular  stimulation  in  the  midpontine  preparation,  con¬ 
trasted  with  the  constancy  and  ease  of  the  demonstration  of  enhancement  following  onset 
or  termination  of  light.  Second,  the  interval  between  light  onset  (or  termination)  and  the 
time  of  delivery  of  the  test  shock  for  greatest  enhancement  is  considerably  shorter  than 
the  corresponding  interval  between  the  onset  of  the  train  of  shocks  to  the  reticular  forma¬ 
tion  and  the  test  shock.  Finally,  it  appears  that  the  enhancement  following  reticular 
stimulation  is  diminished  more  by  a  very  light  dosage  of  barbiturate  anesthetic  than  the 
enhancement  produced  by  transients  in  the  light.  None  of  these  findings  provides  con¬ 
clusive  evidence  one  way  or  the  other. 
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Farther  studies  include  a  detailed  investigation  of  the  enhancement  as  a  function  of 
the  time  between  the  light  onset  (or  termination)  and  delivery  of  the  test  shock.  The 
question  whether  the  facilitation  of  cortical  neurons  is  limited  to  the  primary  visual 
area  is  also  being  investigated. 

A.  Cavaggioni,  M.  H.  Goldstein,  Jr.,  Eleanor  K.  Chance 
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E.  STUDY  OF  THE  HANDWRITING  MOVEMENT 

An  experimental  system  to  investigate  the  detailed  properties  of  pen  displacement 
and  velocity  as  a  function  of  time  during  handwriting  has  been  put  into  operation.  A 
commercial  handwriting  transmitter  (Telautograph)  has  been  modified  so  that  its  out¬ 
put  signals,  voltages  from  two  precision  potentiometers  mechanically  coupled  to  the  pen, 
may  be  sampled,  converted  to  digital  form,  and  written  on  magnetic  tape  by  the 
TX-0  computer.  The  analysis  of  these  data  is  carried  out  by  the  IBM  7090  computer  of 
the  Computation  Center,  M.  I.  T. 

Data  analysis  proceeds  as  follows.  The  x  and  y  coordinate  sequences  are  first 
recovered  by  solving  the  equations  of  mechanical  constraint  of  the  pen  and  then  differ¬ 
entiated  to  yield  the  pen  velocity  vector  function.  A  segmentation  into  strokes  is  carried 
out  by  using  the  zeros  of  the  y  velocity  as  segmentation  points.  A  parametric  descrip¬ 
tion  of  the  strokes  thus  obtained  is  determined  on  the  basis  of  the  following  models. 

1.  Model  A 

This  model  matches  half-wave  sinusoidal  segments  x(t),  y  (t)  to  the  experimen- 
•  •**  m  2 

tally  obtained  functions  x(t)  and  y(t).  It  corresponds  closely  to  that  of  Eden  with  the 
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exception  that  his  slope  parameter  <r  has  been  replaced  by  a  phase-shift  parameter  4> 
for  mathematical  convenience. 


Approximating  Equations 
im(t)  =  X  sin  [u)(t-t0)+4>]  +  Xc 

f  t0<t<t, 

ym(t)  =  Y  sin  u(t-t0) 


Constraints 


«  =  "/(W 

(1) 

*m(V  = 

(2) 

*m(t2>  =  *(t2> 

(3) 

im^dt  =  ! 

X0 

r 2  x(t)  dt 

(4) 

I  2  *m(t,dt\ 

^  2  y(t)  dt, 

(5) 

where  tQ  and  t2  are  the  experimentally  determined  segmentation  points. 

2.  Model  B 

This  model  is  an  extension  of  Model  A  which  allows  for  differences  between  the 
acceleration  and  deceleration  sections  of  any  stroke.  The  x- velocity  amplitude,  the 
frequency,  and  the  phase  shift  between  x  and  y  velocities  are  allowed  to  take  on  dif¬ 
ferent  values  over  the  two  ranges. 

Approximating  Equations 


*m(t)  =  *1  Sin  (“l^O^ll  +  Xc 

tQ<t«t 

=  Xj  sin  [u2(t2-t)-*2]  +  Xc 

tj  <  t  *  t. 

ym(t)  =  Y  sin  [uj(t-t0)] 

t0  <  t  *  * 

=  Y  sin  [w2(t2-t)J 

tj  ct  «t; 

Constraints 

Equations  2-5  of  Model  A  and 
<*>!  3  »/2(tj-t0) 


i 
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“2  =  1I/2(t2-t1) 

Xj  cos  4>j  =  X2  cos  4>2  (continuity  at  t=t^ ) 

UjXj  sin  <j>j  =  “2X2  sin  ^2  (continuous  derivative  at  t=tj), 

where  t^  and  t2  are  experimentally  determined  segmentation  points,  as  above,  and 
tj  is  mathematically  unconstrained  but  experimentally  determined  by  minimizing  the 
rms  error  between  the  experimental  and  matching  functions. 

3.  Discussion 

Writing  samples  produced  by  two  input  methods  were  obtained  from  one  subject  and 
processed.  The  first  utilized  the  handwriting  transmitter  mentioned  above  in  which  the 
writing  is  executed  on  a  horizontal  surface  (Input  1).  In  the  second  the  writing  is  exe¬ 
cuted  on  the  vertical  face  of  the  TX-0  computer  oscilloscope  (input  2).  The  rms  error 
ratios  (rms  error/rms  signal)  have  been  calculated  to  obtain  an  indication  of  the 
matching  accuracy  for  both  velocity  and  displacement  functions  and  are  given  in 
Table  XV- 1.  The  extension  of  these  results  to  include  data  from  different  subjects  is 
under  way  at  the  present  time. 


Table  XV- 1.  Matching  accuracy  of  handwriting  models. 


Model 

Input  la 

Input  2b 

A 

Displacement 

Error 

.047 

.046 

A 

Velocity 

Error 

.232 

.112 

B 

Displacement 

Error 

.039 

.045 

B 

Velocity 

Error 

.  175 

.088 

aAverage  of  9  samples 
^Average  of  4  samples 

It  is  immediately  apparent  that  a  relatively  poor  match  in  the  velocity  domain  will, 
when  integrated,  result  in  a  quite  acceptable  match  in  the  displacement  domain.  Hence, 
for  Input  1,  the  input  method  of  major  interest,  while  Model  A  can  be  considered  ade¬ 
quate  in  the  displacement  domain,  it  appears  inadequate  for  the  study  of  the  detailed 
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intrastroke  kinematics  that  are  observable  only  in  the  velocity  domain.  Model  B  clearly 
effects  a  significant  improvement  for  the  velocity  error  over  Model  A.  The  differences 
between  the  two  models  for  Input  2,  for  which  the  major  part  of  the  writing  movement 
is  executed  by  the  elbow  and  the  upper  arm  rather  than  the  wrist  and  fingers,  are  much 
reduced  and  thus  indicate  that  the  kinematics  of  the  segments  of  positive  and  negative 
acceleration  within  any  stroke  are  much  more  alike.  The  smaller  errors  obtained  for 
Input  2  indicate  that  this  mode  of  writing  is  smoother  and  corresponds  more  closely  to 
a  harmonic  oscillator  system. 

For  Model  A  five  parameters  [X,  X  ,  Y,  u,  <j>]  and  for  Model  B  eight  parameters  [X., 
X2>  Xc,  Y,  Uj,  u2>  <t>j,  $2J  are  required  to  be  known  for  the  regeneration  of  any  stroke.  In 
each  case  the  validity  of  the  approximation  is  revealed  by  the  close  match  between  the 
original  and  regenerated  two-dimensional  handwriting  patterns.  The  representations 
also  achieve  a  reduction  in  the  information  storage  per  word  over  function  samples  at 
the  minimum  sampling  frequency  for  perfect  signal  recovery  if  the  velocity  function  is 
considered  bandlimited  at  20  cps. 

The  usefulness  of  the  models  for  recognition  work  is  primarily  determined  by  the 
statistical  properties  of  the  above-given  parameters.  Experimental  studies  of  the  vari¬ 
ations  in  stroke  parameters  with  writer  and  context,  as  well  as  their  consistency  in  the 
absence  of  such  variations,  are  now  being  carried  out.  It  is  desired  to  determine  the 

conditional  probability  densities  p(v./s,),  where  v,  is  the  jth  parameter  and  s.  is  the 
th  J  1  3  t 

i  stroke  group,  and  strokes  are  grouped  according  to  topological  similarities.  If  the 

information  content  of  the  parameters  is  sufficiently  high,  that  is,  if  p(v./s,  )  »  p(v./s.) 

3  K  3  i 

for  all  k  *  1,  then  reliable  recognition  of  which  stroke  group  the  sample  belongs  to  may 
be  expected.  The  strong  statistical  constraints  in  sequences  of  particular  strokes  are 
to  be  used  in  the  final  identification  of  the  individual  letters. 

P.  Mermelstein 


References 

1.  M.  Eden,  Handwriting  and  pattern  recognition,  IRE  Trans.,  Vol.  IT-8,  pp.  160- 
166,  1962. 

F.  STROKE  ANALYSIS  OF  DEVANAGARI  CHARACTERS 

Recent  studies1  of  the  characteristics  of  English  cursive  handwriting  have  raised 
the  question  whether  writing  systems  unrelated  to  English  might  be  a  useful  comple¬ 
mentary  study.  For  this  purpose,  the  Devanagari  script  of  India  offers  interesting 
possibilities  because  in  its  present  form  it  has  been  fairly  static  for  more  than  a  thou¬ 
sand  years,  despite  the  development  of  closely  related  scripts,  such  as  Bengali  and 
Gujarati,  which  are  much  faster  to  write. 
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It  is  only  in  the  past  hundred  years  that  mechanical  methods  of  writing  have  become 
widespread  in  the  West;  and  during  the  past  decade  strenuous  efforts  have  been  made  to 
develop  a  satisfactory  typewriter  for  Indian  languages.  Meanwhile,  typesetting  equip¬ 
ment  that  can  produce  aesthetically  satisfying  printed  material  is  also  being  sought.  At 
the  present  time,  only  hand-set  type  fonts  making  use  of  400-800  pieces  of  type  come 
close  to  looking  authentic,  and  such  useful  devices  as  linotype  have  basic  structural  lim¬ 
itations  that  prevent  them  from  accommodating  such  a  variety  of  symbols. 

Devanagari  writing  (used  for  Sanskrit,  Hindi,  Marathi,  Punjabi,  and  in  modified 
form  for  Gujarati  and  Bengali)  is  syllabic.  That  is,  the  unit  of  writing  is  the  consonant 
(or  consonant  cluster)  combined  with  the  following  vowel,  which  is  written  to  the  left, 
right,  above  or  below.  This  combination  is  called  an  akshar.  For  example, 

^f-*-£+T+'T=>  r*"V  6i*-C+n*' 

Vowels  have  a  special  written  form  when  they  appear  without  a  preceding  consonant. 

1 .  Research  Objectives 

The  aim  of  this- research,  then,  is  to  study  the  structure  of  the  Devanagari  symbols 
with  a  view  toward  understanding  whether  and  how  the  consonants  and  consonant  clusters 
(a  brief  survey  uncovered  221  distinct  clusters)  can  be  generated,  and  described,  dis¬ 
tinguished  or  selected,  in  a  consistent  manner. 

A  consonant  (or  cluster)  appearing  alone  is  read  as  containing  the  "intrinsic "  vowel, 
/a/.  Frequency  studies  in  Hindi  and  Marathi  show  that  the  most  frequent  occurrences 
are  of  a  single  consonant  with  /a/,  and  therefore  we  shall  study  the  writing  of  the  33 
Devanagari  consonants  first. 

In  this  report  a  stroke  analysis  appropriate  to  Devanagari  is  presented,  together 
with  the  joining  rules  for  normal  writing.  These  rules  form  a  complete  set  with  respect 
to  generation  of  the  consonants,  as  shown  in  Fig.  XV- 11. 

2.  Development  of  the  Set  of  Basic  Strokes 

In  contrast  to  English  cursive  script,  penlifts  occur  in  Devanagari  even  in  the  writing 
at  a  single  "letter."  Strokes  are  written  from  left  to  right  and  from  top  to  bottom.  There 
are  very  few  exceptions.  It  is  possible  to  develop  most  of  the  strokes  by  applying  two 
kinds  of  transformations  to  a  basic  Straight  Stroke  I  /A/  and  Curved  Stroke  Q  /B/. 
We  call  the  operator  Reflection  (about  a  vertical),  a.  We  call  the  operator  Rotation,  p 
(Rotation  by  45* ,  Pji  Rotation  by  90',  P2).  Applying  Pj  and  P2  to  /A/,  we  get 

A  i  PjA  \  P2A  -#  pjP2A  S  P2A  f  pjp*A  N.  p*A  *- 

Of  these,  only  the  first  four  occur.  They  are  labelled  (for  convenience,  and  to  avoid 
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subscripts):  /A/,  /D/,  /L/,  /Q/.  Applying  a  and  P2  to  /B/,  we  get 

BQ  P.BU' 

*PkB^ 

Of  these,  all  occur  except  the  last.  They  are  labelled  (again,  for  convenience):  /B/, 
/EX,  /J/,  /N/,  /h/,  /M/,  /K/.  Two  other  strokes  /f/  and  ?  /g/  are  reflections 
of  each  other.  Two  miscellaneous  strokes  «)  /C/  and  v^r  /P/  complete  the  set  of  15 
Basic  Strokes. 

The  stroke  sizes  have  been  quantized  to  two  levels.  For  the  larger,  we  use  upper¬ 
case  labels;  and  for  the  smaller,  we  use  lower-case  labels.  (Concerning  the  third  Basic 
Stroke  see  II  (iii)  below.) 

A. |  a.' ,  B C t.c  ,  c.JJ ,  c/.^e.u,  fTC,g.  7,  h.?, 

J.Oj.  \  k  e  ,  <•-,  w\u,  v\.n  t  /. 

For  three  strokes,  both  large  and  small  sizes  occur.  In  all  other  cases  only  the 
smaller  size  is  utilized. 

A  distinctive  feature  of  Devanagari  is  the  top  line  that  "ties  ■  together  the  character 
(and  is  written  last).  Certain  characters  have  a  broken  top  line,  and  for  these  /(r)/  is 
added  to  the  linear -form  description  of  the  character. 

One  other  symbol,  /s/,  is  used  to  signify  a  penlift.  Where  this  is  redundant  infor¬ 
mation  (the  penlift  being  due  to  the  direction  in  which  the  previous  stroke  was  written), 
fit  is  used. 

3.  General  Comments  on  Characteristic  Features  of  the  Strokes 

We  shall  list  some  characteristic  features  of  the  strokes  in  a  purely  descriptive 
manner.  Then  we  shall  discuss  some  cases  in  which  contextual  clues  are  relevant  to 
the  role  of  the  individual  stroke  in  forming  the  characters.  Relatively  simple  rules  suf¬ 
fice  to  indicate  whether  the  larger  or  smaller  size  of  a  given  stroke  is  to  be  used  in  a 
given  situation.  Thus  the  same  code  could  unambiguously  apply  to  both.  Similarly,  in 
certain  contexts  a  variant  on  the  usual  form  (or  shape)  of  a  stroke  is  used.  To  simplify 
reading,  we  have  retained  the  lower-  and  upper-case  designations  in  the  "linear  forms" 
given  in  Fig.  XV- 11. 

I.  Characteristic  Features  of  the  Strokes 

(i)  Written  from  top  to  bottom:  A  B  c  d  F  g  h 

Written  from  left  to  right:  d  e  1  p  q 

Written  from  right  to  left:  c  F  m  n 

Written  from  bottom  to  top:  J  k  q 
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(11)  Initial  movement  downward:  A  c  d  e  m  p 
Initial  movement  to  right:  d  g  h  J  p  q 

Initial  movement  to  left:  B  F  k 

Initial  movement  upward:  n  q 

(iii)  Requiring  counterclockwise  movement:  B  e  f  j  m  p 

Requiring  clockwise  movement:  c  g  h  n  k 

(iv)  Positioning  in  letter  space:  Height 

Always  found  at  same  distance  below  top  line:  AaBcFGJklmp 
Height  in  character  can  vary:  b  d  e  h  n  q 

The  rarity  of  strokes  written  leftward  or  upward,  as  indicated  in  I  (ii),  is  even  more 
striking  when  the  relative  frequency  of  these  strokes  is  taken  into  consideration. 

The  data  of  I  (iv)  indicate  a  high  degree  of  redundancy,  in  that  most  strokes  are  not 
found  except  in  certain  regions  of  the  letter  space  and  may  perhaps  explain  the  reada¬ 
bility  of  the  script  despite  its  complexity. 

II.  Notes  on  Certain  Strokes 

(i)  /A, a/  Both  large  and  small  forms  are  always  written  from  the  top  line 
As  an  initial  stroke,  it  occurs  only  as  /a/ 

As  medial  or  final  stroke,  it  occurs  only  as  /A/ 

It  is  interesting  to  note  that  none  of  the  characters  has  a  vertical  line  segment  except 
that  hanging  from  the  top  line. 

(ii)  /B.b/  As  an  initial  stroke  this  occurs  only  as  /b/,  starts  from  the  top  line 

only  in  sequence  /bb. .  /.  Otherwise  it  is  written  in  the  middle  of  the 
letter  space. 

In  medial  position  the  form  is  /b/  in  context  /abh/;  otherwise  B  as  in 
aB,  aBJ,  aBn,  aBjc. 

(iii)  /c/  This  stroke  has  two  forms.  It  is  written  (J  )  when  followed  by  /l/; 

otherwise  it  is  written  ( J  ). 

(iv)  /e/  In  context  / le . .  /  the  form  is  (  O)  instead  of  the  normal  ( <-> ). 

(v)  /J.j/  In  context  /aBJ/  the  form  is  «b);  i.e.,  /j/.  Otherwise  (  »  ),  /j/. 

(vi)  /l/  Always  written  at  mid-height  in  letter  space.  The  group  /.  .1A. ./  is 
frequent  and  is  uniformly  written  H  . 

4.  General  Rules  of  Formation  of  Characters  from  Strings  of  Strokes 

The  rules  presented  below  may  be  considered  a  preliminary  set.  As  formulated  at 
present,  they  deal  with  the  construction  of  the  consonants.  They  have  been  left  flexible, 
however,  in  order  to  accommodate  the  numerous  and  complex  cases  of  conjunct  charac¬ 
ters.  A  significant  feature  of  the  rules  is  that  transitional  strokes  are  obviated  except 
with  /A/. 
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Fig.  XV-11.  The  33  Devanagari  consonants  and  their 
"linear"  representation. 
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III.  General  Rules 

(i)  The  highest  point  of  the  first  stroke  is  always  at  the  top  line,  except  that  initial 
/b/»  /F/»  M>  /*/»  /m/  are  written  at  mid-height.  (Note  that  initial  /bb. ./  is 
written  from  the  top  line.) 

(ii)  Strokes  always  commence  from  the  end  point  of  the  previous  stroke  except  as 
in  in  (v)  below. 

(iii)  Mark  ///  implies  a  penlift  that  is  due  to  the  direction  of  stroke,  and  is  a  redun¬ 
dant  instruction.  Mark  /s/  implies  a  penlift  that  is  due  to  the  fact  that  the  char¬ 
acter  has  two  segments,  and  implies  a  space  left  between  the  segments. 

(iv)  After  the  listed  strokes  comes  the  conventional  top  line.  In  cases  in  which  the 
additional  instruction  /(r)/  is  added,  it  implies  that  the  top  line  is  written  over 
the  right  half  of  the  character  only. 

(v)  Added  transitional  strokes  are  used  before  arid  after  /A/  as  follows: 

m*  +T 

*  _  xA 

J.  +  7  ^  "IF4-  ~l  ^  IF  /<j/ occurs  1*1') 

A  -at 


IV.  Examples  of  Operation  of  the  Rules 

/b:  *  /bb:  £  /bA/:  4  /bbq:  £/  /bbqn:  £?  /bbqnc:  £$  /bbqnc/:  £% 

In  the  first  example,  *  implies  uncertainty  till  the  sequence  is  known. 

S.  Remarks 


The  work  reported  here  provides  a  means  of  uniquely  generating  the  33  consonants 
of  the  Devanagari  alphabet  from  a  set  of  15  Basic  Strokes.  In  Fig.  XV-11  the  stroke 
descriptions,  which  are  essentially  linear  representations  of  two-dimensional  charac¬ 
ters,  are  underlined  to  the  point  where  unambiguous  identification  is  possible. 

This  work  can  be  logically  extended  to  the  study  of  akshar  formation  and  the  adapta¬ 
tion  of  the  present  strokes  and  rules  to  this  far  more  complex  situation.  In  another  direc¬ 
tion  lies  the  comparison  of  the  Bcript  described  in  this  report,  as  used  in  Sanskrit,  Hindi, 
Marathi,  and  Punjabi,  with  related  scripts.  Gujarati  on  the  West  coast  of  India,  and  Ben¬ 
gali  in  the  East,  have  developed  written  alphabets  that  are  obviously  offshoots  of  Sanskrit- 
Devanagari.  The  study  of  these  scripts  in  relation  to  their  parent  writing  form  is  not  only 
of  anthropological  interest  but  also  provides  an  opportunity  for  a  comparative  study  of 
the  cognitive  roles  of  the  constituent  strokes  in  the  related-but-different  formations. 

J.  G.  Krishnayya 
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A.  THE  EFFECT  OF  PENTOBARBITAL  ON  THE  VISUAL  SYSTEM 

OF  RATS 

The  optic  nerves  of  albino  rats  anesthetized  with  sodium  pentobarbital  (50  mg/kg) 
were  exposed  by  removing  the  cerebrum  lying  over  the  nerves  by  suction.  In  such  a 
preparation,  the  circulation  in  the  retina  and  optic  nerve  remains  intact.  The  dura 
and  nerve  sheath  were  removed  from  the  nerve  between  the  optic  foramen  and  the 
optic  chiasm  and  single  units  were  recorded  by  means  of  metal-filled  glass  micro¬ 
pipettes,  plated  with  platinum  black.1  Single  units  can  be  stably  recorded  for  one 
hour  and  longer;  moreover,  units  recorded  in  sequence  along  the  track  of  the  elec¬ 
trode  can  be  recovered  upon  withdrawal  of  the  electrode  in  the  reverse  sequence. 

In  such  a  preparation  under  pentobarbital  the  most  striking  activity  recorded  is  a 
synchronous,  spontaneous  bursting  of  most  of  the  active  units  at  approximately  15  cps 
(Figs.  XVI- 1  and  XVI-2).  Almost  all  such  units  respond  with  a  burst  of  spikes  to  the 
"off"  of  the  general  illumination,  and  are  desynchronized  when  the  general  illumination 
is  turned  on  (Fig.  XVI-2).  This  bursting  "off"  unit  activity  is  quite  well  synchronized 
through  the  nerve;  a  5-micron  KC1  pipette,  placed  on  the  surface  of  the  nerve,  records 
a  periodic  potential  of  approximately  15-cps  frequency  (Fig.  XVI-3). 

The  bursting  "off"  unit  activity  is  not  present  in  the  optic  nerve  axons  of  an  animal 
prepared  without  any  barbiturate;  these  animals  were  prepared  under  ether  anesthesia. 

A  midcollicular  section  was  performed,  followed  by  removal  by  suction  of  the  cerebrum 
overlying  the  optic  nerve.  If  the  optic  nerve  in  such  an  unanesthetized  animal  is  crushed 
close  to  the  optic  chiasm  and  records  taken  distal  to  the  crush,  no  such  bursting  activity 
is  seen.  This  suggests  that  the  bursting  activity  is  not  due  to  suppression  of  some  effer¬ 
ent  activity  by  the  barbiturate.  Furthermore,  an  electrode  records  many  more  units  in 
the  unanesthetized  animal,  many  of  which  are  "on"  units.  If  the  sequence  and  number  of 
units  along  an  electrode  track  is  noted  and  Nembutal  administered,  as  the  level  of  anes¬ 
thesia  deepens  and  the  electrode  is  withdrawn,  the  nerve  is  found  to  be  relatively  much 
more  quiet,  with  groups  of  bursting  "off"  units  not  before  observed  now  being  recorded, 
which  have  quiet  regions  between  them.  Conversely,  as  the  level  of  anesthesia  lightens. 
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Fig.  XVI- 1.  A  single  unit  of  the  "bursting  off  unit"  variety 
recorded  in  the  optic  nerve  of  an  albino  rat 
under  pentobarbital.  Time  marker:  50  per 
second. 


p:r_Jt  )*>  \  ’  . 


Fig.  XVI-2.  Several  bursting  units  recorded  in  the  optic 
nerve  of  an  albino  rat  under  pentobarbital. 
The  background  illumination  was  turned  on 
in  the  middle  of  the  sweep  and  off  at  the  end. 
Time  marker:  2  per  second. 


Fig.  XVI-3.  Slow  potentials  recorded  with  a  gross  electrode 
from  the  surface  of  the  optic  nerve  of  an  albino 
rat  under  pentobarbital.  Time  marker:  4  per 
second. 
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the  bursting  activity  diminishes  and  types  of  units  other  than  the  bursting  "off"  units 
are  recorded. 

This  bursting  "off"  unit  activity  was  also  seen  in  albino  rats  under  allobarbitone 
(Dial,  Ciba),  and  in  both  albino  and  Long-Evans  hooded  rats  under  pentobarbital  with 
the  recordings  taken  stereotaxically  with  the  same  type  of  low-impedance  electrodes. 

J.  E.  Brown,  J.  A.  Rojas 
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B.  SINGLE-UNIT  RESPONSES  IN  THE  CEREBELLUM  OF  THE  FROG 

The  cerebellum  of  the  frog  is  a  bean-shaped  organ  lying  on  the  dorsal  side  of  the 

brain  between  the  medulla  and  the  tectal  lobes.  The  largest  nerve-fiber  bundles  entering 

the  cerebellum  are  the  (dorsal)  spinal  tract  and  the  veBtibular  tract.  Small  tracts 

course  in  from  nerves  IV  and  V,  from  the  tectum,  and  ;rom  sundry  nuclei  in  the  medulla. 

The  histological  structure  of  the  organ  upon  which  these  fiber  tracts  of  different  sensory 

modalities  converge  is  similar  to,  but  less  complex  than,  that  of  the  mammal.  That  is, 

according  to  the  scanty  and  sketchy  anatomy  that  we  find  (in  the  frog)  there  are  the  three 

distinct  layers  —  the  molecular,  the  granular,  and  the  Purkinje  cell  layers  —  but  their 

1  2 

arrangement  and  interconnection  is  less  elaborate  than  in  the  mammal.  ‘ 

The  present  study  attempts  to  reveal  through  microelectrode  recording  how  each 
modality  is  handled  separately  and  in  combination  by  short-term  responses  in  cells  of 
the  cerebellum.  The  recording  was  done  with  metal-filled,  platinum-plated,  glass  elec¬ 
trodes  on  curarized  (in  several  cases,  uncurarized)  frogs  (R.  pipiens).  I  distinguished 
the  stimuli  applied  to  the  frog  mainly  by  general  qualitative  differences. 

In  harmony  with  the  nature  of  the  stimuli,  this  report  is  a  qualitative  description  of 
the  behavior  of  cerebellar  cells  in  the  frog.  This  description  is  not  exhaustive  and  deals 
only  with  those  cells  in  which  the  response  was  a  clear-cut  change  in  activity.  It  will 
be  presented  as  a  set  of  categories  representing  the  nature  of  the  applied  stimuli  — 
vestibular,  mechanical,  visual,  and  combinations  of  these  three. 

Some  of  the  vestibular  organs  are  simultaneously  stimulated  by  a  rotation  of  the  frog 
about  an  axis  inclined  to  the  horizontal.  Cells  that  respond  to  this  mode  of  stimulation 
are  of  the  following  types: 

(i)  One  kind  has  a  periodic  resting  rate,  responds  to  rotation  both  clockwise  and 
anticlockwise  with  a  continuous  barrage,  the  frequency  of  which  depends  on  the  velocity 
of  rotation.  These  resting  rates  change  with  changes  in  the  position  of  the  head. 

(ii)  Another  kind  usually  has  an  aperiodic,  bursty,  resting  rate  and  responds  to 
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rotation  in  one  direction  with  a  burst  whose  frequency  depends  on  the  magnitude  of  the 
acceleration.  It  is  inhibited  by,  or  responds  only  weakly  to,  a  rotation  in  the  other  direc¬ 
tion.  Such  a  cell  shows  a  rapid  adaptation  to  successive  rotatory  (back  and  forth)  stim¬ 
uli.  It  usually  does  not  show  nicely  periodic  changes  in  resting  rate  with  changes  in 
position  of  the  head;  although,  in  several  cases,  a  change  in  position  did  produce  a 
quickening  of  the  rate  (Fig.  XVI-4). 

(iii)  Another  kind  is  (a)  silent  and  responds  to  a  rotation  in  one  direction  with 
an  abrupt  burst  of  five  or  more  spikes  whose  frequency  is  proportional  to  the  rate  of 
change  of  acceleration,  or  (b)  is  silent  and  responds  with  a  spike  or  two  at  the  end  of 
rotation.  Both  (a)  and  (b)  accommodate  to  a  second  stimulus  presented  with  a  period  of 
10  seconds. 

In  all  of  the  above-mentioned  cells  possessing  a  resting  rate,  that  resting  rate  is 
depressed  after  a  response  but  slowly  returns  after  a  period  of  at  least  10  seconds 
(often  1  minute).  The  length  of  this  period  depends  on  the  strength  and  the  duration 
of  the  stimulus. 

Mechanical  stimulation  is  performed  on  the  limbs,  and  to  a  lesser  extent  on  other 
parts  of  the  body  —  neck,  back,  snout,  eye  muscles,  cornea.  Cells  that  respond  to 
this  mode  of  stimulation  are  of  the  following  types: 

(i)  One  has  usually  an  aperiodic  resting  rate,  responds  with  a  burst  to  changes 
in  limb  position,  and  rapidly  and  completely  adapts  to  a  second  stimulus  for  a  period 
of  at  least  10  seconds.  These  cells  are  sensitive  to  at  least  one  and  perhaps  several 
muscles  acting  about  a  particular  joint  in  the  limb,  and  often  will  respond  to  the  same 
stimulus  in  both  left  and  right  limbs  (Fig.  XVI-5). 

(ii)  Another  has  a  periodic  resting  rate  and  responds  with  a  burst  to  any  one  of 
the  following  stimuli,  the  choice  depending  on  the  cell:  Pressing  on  the  limb  pads  (of 
sometimes  one,  two,  or  all  four  limbs),  pressing  on  the  eye  muscles,  and  blowing  and 
brushing  on  the  cornea  and  various  parts  of  the  skin.  These  stimuli  depress  the  peri¬ 
odic  resting  rate  after  a  response. 

(iii)  A  third  is  silent  and  gives  a  periodic  response  whose  frequency  is  pro¬ 
portional  to  the  amount  of  bend  of  a  particular  joint  in  one  limb.  These  responses  are 
nonadaptive  and  may  reach  a  high  firing  rate  of  several  hundred  spikes  per  second,  and 
maintain  this  rate  for  aB  long  as  the  joint  is  bent  (Fig.  XVI-6). 

The  only  other  sensory  modality  seen  thus  far  to  affect  some  cells  of  the  frog  cere¬ 
bellum  is  vision.  These  cellB  have  an  aperiodic  resting  rate  and  respond  with  an  abrupt 
burst  to  the  movement  of  an  object  (for  example,  a  hand)  within  the  visual  field  of  an 
eye.  They  adapt  after  a  single  stimulus,  and  will  not  respond  again  to  this  stimulus 
for  at  least  15  seconds.  They  thus  have  the  same  properties  as  the  'newness'  neurons 
seen  by  Lettvin  and  others  in  the  optic  tectum.3 

Interactions  between  the  various  modalities  occur  in  the  cells  described  above. 
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Fig.  XVI-4.  Vestibular  type  ii  cell.  Onset  of  stimulus  indicated  by 
an  arrow  beneath  the  trace.  Spike  height  ~1.5  mv. 

a.  resting  rate  (head  in  normal  position). 

b.  rotation  90*  counterclockwise. 

c.  resting  rate  in  new  position. 

d.  rotation  90*  clockwise. 

e.  initial  resting  rate. 

f.  time  base,  10/sec, 
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Fig.  XVI-5.  Mechanical  type  i  cell.  Spike  height  ~2  mv. 

a.  resting  rate. 

b.  right  leg  picked  up  and  put  down  (up  and 
down  indicated  by  arrows). 

c.  resting  rate. 

d.  left  leg  picked  up  and  put  down. 

e.  time  base,  10/sec. 
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These  cells  behave  as  mixtures  of  the  following  kinds: 

(a)  Vestibular  1  and  Mechanical  2.  These  cells  have  a  periodic  resting  rate  and 
will  respond  to  rotation  and  change  their  resting  rate  with  changes  in  the  position  of 
the  head,  will  respond,  for  example,  with  a  burst  to  pressing  on  left  or  right  toe  pad, 
and  will  have  their  resting  rate  depressed  immediately  thereafter  (Fig.  XVI-7).  Similar 
responses  are  elicited  in  other  cells  with  other  mechanical  stimuli. 

(b)  Vestibular  2  and  Mechanical  1.  These  cells  have  an  aperiodic  resting  rate  and 
will  respond  with  a  burst  to  either  rotation  or  movement  of  one  or  several  limbs, 
and  will  have  their  resting  rate  depressed  immediately  thereafter. 

(c)  Vestibular  2  and  Visual.  These  behave  as  those  in  (b),  except  that  vision  rather 
than  mechanical  stimulation  causes  response.  Visual  responses  have  been  seen  in  only 
this  type  of  cell. 

No  attempt  has  yet  been  made  at  strict  anatomical  localization  of  the  above-mentioned 
groups  of  cells  with  respect  to  the  layers  of  the  cerebellum.  Physiologically,  in 
penetrating  the  dorsal  surface  of  the  cerebellum,  the  electrode,  inclined  caudally  at  a 
very  slight  angle  to  the  vertical,  encounters  three  distinct  layers  consisting  of  the  fol¬ 
lowing  groups  of  cells: 

(i)  vestibular  2,  vestibular  3,  mechanical  1,  visual 

(ii)  vestibular  1,  mechanical  2 

(iii)  mechanical  3. 

The  time  constants  of  the  cells  in  these  three  layers  are  also  appreciably  different. 

The  height  of  the  potentials  recorded  in  the  first  layer  is  invariably  greater  than  that 
of  those  in  the  other  two,  and  the  cells  in  this  layer  are  electrically  closely  packed. 
These  cells  are  probably  the  Purkinje  cells,  which  are  the  largest  (-10  p  in  diameter) 
and  the  most  closely  packed  cells  in  the  cerebellum. 

With  the  exception  of  vestibular  3  responses,  the  short-term  response  pattern  of 
the  cerebellar  cells  is  not  appreciably  different  from  that  found  in  the  fiber  tracts 
entering  the  cerebellum.  In  fact,  the  response  in  mechanical  3  seems  no  different 
from  what  would  be  expected  in  a  tendon-stretch  receptor  itself,  unless,  indeed,  this 
response  is  combinatorial  with  other  stretch  receptors  in  the  limb  (as  is  perhaps  the 
case  in  Fig.  XVI-6).  The  differences  are  found  in  the  combination  of  the  responses  of 
various  modalities  and  in  that  of  the  responses  from  several  limbs  within  a  single  cell. 

The  most  patent  use  to  the  frog  of  the  short-term  responses  of  cerebellar  cells  is 
in  the  rapid  control  of  efferents  to  muscle.  For  example,  Granit,  Holmgren,  and 
Merton4  have  shown  that  in  the  cat  the  cerebellum  in  some  way  regulates  the  proportion 
of  the  two  routes  of  muscle  excitation,  a  and  •y,  in  use  at  any  time.  That  there  is  a 
large  representation  of  the  y  efferents  in  the  cerebellum  is  seen  in  the  large  decrease 
in  cellular  activity  in  layer  ii  when  a  frog  is  curarized.  (Curare  knocks  out  muscle 
spindle  activity.)  The  proportion  of  the  two  routes  in  use  in  a  given  muscle  at  any  time 
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is  very  likely  dependent  on  vestibular,  dermal,  visual,  and  other  muscular  activity. 

The  nature  of  these  short-term  responses  allows  the  cerebellum  to  perform  just  such 
a  regulation. 

The  long-term  responses  of  the  cerebellar  cells  consist  in  the  setting  and  modulation 
of  the  complex  resting  rates  (not  strictly  aperiodic  but  almost  cyclic  in  the  repetition 
of  a  bursting  pattern)  of  those  cells  by  previous  vestibular,  mechanical,  and  perhaps 
visual  stimuli.  These  kinds  of  responses  are  suited  to  making  the  cerebellum  in  part 
an  organ  that  can  'remember 1  previous  trajectories  of  a  frog  in  its  milieu,  and  that  can 
plot  intended  trajectories  of  the  animal.  This  bit  of  speculation  will  be  the  object  of 
further  study  on  the  cerebellum  of  the  frog. 

F.  S.  Axelrod 
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A.  BLACK-BOX  DESCRIPTION  AND  PHYSICAL  ELEMENT  IDENTIFICATION 
IN  THE  PUPIL  SYSTEM 

Black-box  input-output  analysis  of  nonlinear  systems  is,  at  present,  a  control -systems 


Fig.  XVII- 1. 

First-  and  second-order 
kernels  of  open -loop  pu¬ 
pil  light  servomecha¬ 
nism. 
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area  of  much  activity.  Functional  analysis1  4  has  a  mathematical  elegance  that  is 

5 

attractive,  and  we  have  applied  this  method  to  the  pupil  light  reflex.  Figure  XVII- 1 
shows  hj(T),  the  first-order  kernel,  and  h2(Tj,T2),  the  second-order  kernel,  as  two- 
dimensional  and  three-dimensional  functions,  respectively.  Work  is  now  under  way  to 
refine  the  experimental  method  to  obtain  more  consistent  and  reliable  data. 

However,  since  our  goal  is  identification  of  the  physical  laws  of  the  anatomical- 
physiological  elements  that  together  make  up  the  pupillary  system,  we  have  attempted 
to  dissect  into  the  black  box.  The  high-frequency  cutoff  is  probably  due  to  the  output 
mechanical  elements,  the  iris  muscles,  as  the  experiment  illustrated  in  Fig.  XVII-2 
demonstrates.^1  Here,  locally  applied  drugs  that  partly  overstimulate  both  the  sphinc¬ 
ter  and  dilator  muscles  reduce  the  bandpass  of  the  system  and  the  gain.  This  confirms 
a  previous  experiment7  in  which  gain  measurements  were  not  obtained. 


Fig.  XVII- 3.  Impulse  response  of  pupil  to  light  (L)  and 
electrical  intracranial  (IC)  stimulation. 


g 

Frank  H.  Baker  has  shown  that  in  anesthetized  cats  direct  intracranial  stimulation 
by  electrodes  stereotactically  placed  near  the  motor  fibers  of  the  oculomotor  nerve  pro¬ 
duces  pupillary  contraction  with  approximately  the  same  transport  delay  and  third-order 
response  as  does  light  stimulation  of  retina,  as  shown  in  Fig.  XVII-3.  Estimates  of 
nerve  conduction  times  and  synaptic  delays  for  the  pupillary  system  range  from  10  msec 

to  20  msec,  in  agreement  with  the  assigning  of  90  per  cent  of  the  transport  delay  and  all 

o 

of  the  high-frequency  attenuation  to  the  output  neuromuscular  elements. 

These  experiments  indicate  the  range  of  approaches  utilizing  system  theory,  and 
neurophysiological  and  neuropharmacological  dissection  techniques,  both  of  which  are 
necessary  to  make  a  quantitative  and  into-the-black-box  analysis  of  a  neurological  con¬ 
trol  system. 

L.  Stark 
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B.  PUPIL  VARIATION  AND  DISJUNCTIVE  EYE  MOVEMENTS  AS  A  RESULT 

OF  PHOTIC  AND  ACCOMMODATIVE  STIMULATION 

There  are  several  methods  of  stimulating  the  iris  muscles  which  result  in 

a  variation  of  the  pupil  diameter  (Fig.  XVII-4).  First,  the  eye  can  be  stimulated 

with  light.  A  signal  goes  from  the  retina  to  the  central  nervous  system  (CNS), 

and  then  back  to  the  iris  muscles.1  This  photic  response  is  involuntary,  and  thus 

prediction  does  not  occur.  Second,  a  target  can  be  moved  on  the  optical  axis 

of  one  eye,  while  a  cooperative  subject  tries  to  keep  it  in  focus.  It  is  evident 

that  this  method  of  providing  a  stimulus  necessitates  a  voluntary  contrubution  by 

the  subject  and  thus  prediction  may  occur.  In  this  situation,  an  error  signal  goes 

to  the  CNS,  indicating  how  far  the  image  is  out  of  focus.  This  is  an  even- error 

signal,  but  in  our  present  experimental  arrangement,  since  there  are  additional 

clues  such  as  the  size  and  brightness  of  the  target,  we  actually  work  with  an 
2 

odd-error  signal.  From  the  CNS  there  is  a  signal  path  to  the  ciliary  muscle 
and  the  dioptric  strength  of  the  lens  varies  in  such  a  way  as  to  obtain  clear 
vision.  Also,  an  additional  signal  goes  from  the  CNS  to  iris  muscles,  and  pupil 
diameter  varies  to  control  depth  of  focus  of  the  eye. 

Third,  the  accommodation  input  by  way  of  the  CNS  results  in  a  disjunctive 
eye  movement  of  the  other  eye,  although  this  eye  cannot  see  the  target.  Figure  XVII-4 
is  a  simplified  block  diagram  of  these  various  inputs,  outputs,  and  interactions. 
The  experimental  arrangement  is  shown  in  Fig.  XVII-5;  it  is  possible  to  stimulate 
one  eye  with  either  a  light  or  an  accommodation  input  and  to  measure  the  pupil 
size.  The  associated  disjunctive  movements  of  the  other  eye  which  resulted  from  the 
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Fig.  XVII-4.  Block  diagram  of  the  interacting 
iris -lens-convergence  system. 


Fig.  XVI1-5.  Experimental  arrangement. 
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accommodation  input  were  measured  separately  with  equipment  previously  described.3 

To  obtain  an  idea  of  the  dynamics  of  the  various  systems,  the  eye  was  stimulated 
with  a  sinusoidal  light  input  or  with  a  sinusoidal  target-position  input  in  the  optical 
axis  of  one  eye.  The  variations  in  pupil  diameter  of  the  same  or  the  disjunctive 
movements  of  the  other  eye  are  measured.  Bode  plots  are  shown  in  Fig.  XVII-6. 

A.  Troelstra,  B.  L.  Zuber,  J.  I.  Simpson,  L.  Stark 
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C.  REMOTE  PATIENT-TESTING  INSTALLATION 

We  have  set  up  a  communication  link  between  the  Howe  Laboratories  of  the 
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Fig.  XVII-7.  Block  diagram  of  remote  patient-testing  installation  using  on-line  computer. 
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Fig.  XVU-8.  Bode  plot  of  eye-tracking  movement  control  system. 
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Table  XVII-1.  Gain  and  phase  as  a  function  of  frequency  typed  out  by  on-line  computer. 
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Massachusetts  Eye  and  Ear  Infirmary1  and  an  on-line  computer  (GE  225)  in  the 
Electronic  Systems  Laboratory,  M.  I.  T.,  for  studying  neurological  eye-movement 
defects.  The  communication  links  are  four  pairs  of  standard  low-fidelity  "direct" 
telephone  lines,  traveling  approximately  3  miles  in  roundabout  fashion  from  Cambridge 
to  Boston. 

The  experimental  arrangement,  as  shown  in  Fig.  XVII-7,  consists  of  a  pseudo¬ 
random  excitation  signal  that  is  generated  by  the  computer,  converted  by  the 
computer  DIOB  (data  input-output  buffer)  to  an  analog  voltage,  and  transmitted 
over  one  pair  of  lines  (0-2000  cps  bandwidth)  to  drive  a  horizontally  moving  spot 

on  an  oscilloscope  face.  The  patient  is  instructed  to  follow  the  spot;  the  angular 

2 

direction  of  his  eyes  is  measured  by  a  pair  of  photocell  goggles,  and  this  response  sig¬ 
nal  is  sent  back  over  another  pair  of  lines,  and  is  digitalized  by  the  DIOB.  The  GE  225 
computer  then  analyzes  the  gain  and  phase  lag  at  each  frequency  of  interest  and 
types  out  the  information.  A  typical  typeout  is  shown  in  Table  XVII- 1.  Figure  XVII-8 
is  a  plot  of  five  successive  runs  on  A.  Troelstra. 

The  peak  in  the  frequency  response  has  been  predicted  by  a  sampled-data 
model  of  the  eye-movement  control  system.'1  An  X-Y  plotter  will  be  installed 
in  the  hospital  laboratory  and  the  plotted  frequency  response  will  be  available 
a  few  minutes  after  the  experiment. 

L.  Stark,  P.  A.  Willis,  Gabriella  W.  Smith 
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D.  EXPERIMENTS  ON  DISCRETE  CONTROL  OF  HAND  MOVEMENT 

Previous  reports  assembled  evidence  regarding  sampled-data  control  phenomena 

in  hand  motor  coordination1  and  also  presented  a  rather  complete  sampled-data  model 

2  3 

for  eye-movement  tracking.  Recently,  a  thesis  has  been  completed  which  further 
explores  experimental  phenomena,  which  are  interpretable  in  terms  of  a  discrete 
model. 

Unpredictable  ramps  are  used  as  an  input  signal,  and  the  rotational  load  is  made 
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low  enough  so  that  the  mechanical  output  elements  (muscles,  load,  apparatus)  do  not 
smooth  the  output  signal  too  drastically.  The  response  of  the  hand  is  then  a  series  of 
steps,  as  shown  in  Fig.  XVII-9;  the  hand  control  system  is  evidently  a  discrete  posi¬ 
tion  control  system.  When  step  amplitudes  and  times  between  steps  are  plotted  as  a 
function  of  ramp  velocity  as  in  Fig.  XVII-10,  it  is  clear,  since  only  the  amplitude  and 
not  the  sampling  period  is  velocity -dependent,  that  the  nature  of  the  discontinuity  is 


Fig.  XVII-9.  Random  ramp  response. 


W  <b> 

Fig.  XVII- 10.  Dependence  of  (a)  step  amplitude,  and  (b)  step 
frequency  as  a  function  of  ramp  velocity. 


QPR  No.  69 


257 


(XVII.  NEUROLOGY) 


not  quantization,  but  rather  like  a  sampled-data  system. 

Experimentally,  varying  feedback  by  means  of  an  environmental  clamp  is  often  use- 
4  5 

ful  in  dissecting  a  system,  '  especially  when  one  can  open  the  loop.  When  the  loop  is 
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Fig.  XVII- 11.  Pulse  response  under  open-loop  conditions. 


opened,  an  unpredictable  pulse  input  should  produce  a  step  response.  The  responses 
to  the  initial  pulses  in  experiments  such  as  those  displayed  in  Fig.  XVII- 11  do  indeed 
approximate  steps.  Since  the  inputs  are  repetitive,  and  thus  eventually  predictable, 
the  subject  adapts  and  finally  compensates  for  the  change  in  feedback.  The  last 
response  to  each  series  of  input  pulses  in  Fig.  XVII- 11  is  more  nearly  a  (dynamically 
limited)  pulse.  Thus,  the  position  control  loop  and  the  adaptive  nature  of  the  hand 
system  are  both  demonstrated  in  these  open-loop  experiments. 

F.  Naves,  L.  Stark 
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A.  MOCK  METRIC  IS  FUN  AND  USEFUL  TOO 

"Four"  is  a  magic  number  for  students,  engineers,  and  scientists  in  every  field  in 
which  problems  involve  comparison  or  conversion  of  linear  measures  expressed  in 
English  and  metric  systems. 

There  is  no  obvious  compatability  between  meters  equal  to  39.  37008  inches  and 
inches  equal  to  0.  0254  meter;  one  familiar  with  either  may  find  frustration  in  the  other. 
I  have  extrapolated  that  if  my  English  ancestors  who  made  the  inch  equal  to  the  width 
of  a  man's  thumb  had  been  foresighted  enough  to  also  legislate  a  man's  thumb  equal  to 
the  fortieth  part  of  a  meter,  the  whole  course  of  history  could  have  been  happier! 

Nevertheless,  we  can  enjoy  a  great  deal  of  convenience  by  imagining  the  happy  ratio 
We  can  say  with  some  degree  of  truth  that  a  meter  is  forty  inches,  and  we  can  imp 
the  forty -inch  meter  divided  and  subdivided  into  the  usual  metric  units.  This  j  to 
imagine  a  linear  metric  system  whose  units  are  a  bit  longer  than  standard  units;  a  bit 
so  small  that  for  many  practical  purposes  it  may  be  ignored,  yet  known  exactly  so  that  it 
may  be  accounted  for  for  other  purposes.  This  is  the  happy-ratio,  first-approximation, 
pseudometric  system  that  I  call  "mock  metric." 

In  mock  metric,  "4"  is  a  magic  number: 

40  inches  =  1  meter  1  inch  =  ^  meter 

4  inches  =  l  decimeter  1  inch  =-^-  decimeter 

0.4  inch  =  l  centimeter  1  inch  »  j»Z.5  centimeters 

0.  04  inch  =  1  millimeter  1  inch  =  ■  25  millimeters 

100,  000 

0.  004  inch  a  100  microns  1  inch  a - ^ - a  25,  000  microns. 

Any  decimal -inch  expression  can  be  divided  by  4  on  mere  inspection;  the  quotient 
is  its  mock -metric  equivalent  in  decimeters  and  may  be  converted  to  any  other  mock- 
metric  unit  by  shifting  the  decimal  point.  For  example, 

7/8  in.  =  0.  875  in.  =  =  0.  21875  dm 

=  2.  1875  cm 
=  21.  875  mm 
=  21,875  p. 

In  mock  metric,  "3"  is  a  magic  number  too,  but  only  for  dealing  with  measurements 
expressed  in  feet.  One  need  not  multiply  feet  by  12  to  obtain  inches  to  be  divided  by  4; 
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multiplying  feet  by  3  gives  the  same  result  expressed  in  mock  decimeters. 

For  example, 

10  ft  a  10  X3  b  30  dm 
■  3  m 
or 

5  ft  9  in.  =  5.  75  ft  =  5.  75  X  3  =  17.  25  dm 

=  172.  5  cm 

and 

3  ft  4  1/2  in.  =  3  ft  4.  5  in.  =  3  X  3  +  ~  9  +  I.  125 

a  10.  125  dm. 

These  mock-metric  expressions  so  easily  obtained  are  known  to  be  not  exact  stand¬ 
ard  metric  statements.  A  feeling  for  the  degree  of  Inexactness  is  needed  for  judgment 
as  to  whether  or  not  it  may  be  ignored  in  a  given  situation.  And  exact  knowledge  of  the 
order  of  the  inexactness  is  needed  wherever  the  inexactness  is  to  be  accounted  for.  Both 
derive  most  logicc.ly,  I  think,  from  considerations  of  the  1000  standard  millimeters  that 
make  a  standard  meter  and  the  1000  mock  millimeters  that  make  a  mock  meter.  Since 

•  -  — 40  inches  a  40  X  25.4  a  1016  standard  millimeters 
and 

40  inches  a  40  X  25  s  1000  mock  millimeters, 
it  follows  that 

1000  any  mock  units  *1016  similar  standard  units. 

The  order  of  the  inexactness  of  this  first  approximation  that  I  call  mock  metric  is 
16  parts  in  1000  or  1. 6  per  cent.  Obviously,  if  1. 6  per  cent  of  itself  is  added  to  any 
mock-metric  figure,  the  sum  will  be  the  exact  standard  metric  equivalent. 

But  an  easier  trick  may  be  to  make  a  second  approximation  by  adding  to  a  mock- 
metric  figure  one -sixth  of  10  per  cent  (l/6  •  10  per  cent)  of  itself. 

For  example, 

1  inch  a  1 '  a  o.  25  (mock  metric  or  first  approximation) 

1  inch  a  0.  2500  +  —  a  0.  25416  {second  approximation). 

But  we  know  that 

1  inch  a  0.  25400  standard  metric  exactly. 

Thus  it  is  seen  that  the  inexactness  remaining  in  the  second  approximation  is 
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reduced  to  16  parts  in  25, 000,  which  is  less  than  0.  1  per  cent  and,  I  think,  of  approxi¬ 
mately  the  same  order  as  the  inexactness  of  our  knowledge  of  the  earth's  circumference! 

For  practical  purposes  then,  in  circumstances  for  which  a  1.  6  per  cent  error  is  not 
significant  or  meaningful,  the  first  approximation  serves.  In  more  stringent  circum¬ 
stances  in  which  1.  6  per  cent  may  be  significant  but  0.  1  per  cent  not  meaningful,  the 
second  approximation  serves. 

Unusual  circumstances  that  require  accuracy  better  than  1  part  in  1000  justify 
the  exact  computation  instead  of  the  second  approximation.  And  there  is  no  easier  way 
to  make  the  exact  computation  than  to  multiply  the  mock-metric  figure  by  1.  016. 

Exact  conversion  formulas  are: 

meters  =  l.  016  standard  metric 

feet  «(3X  feet)  1.  016  standard  metric 
feet  and  inches  =  ^3  X  feet  +  1.016  standard  metric. 

The  whole  philosophy  of  mock  metric  is  reversible,  of  course.  Any  standard  metric 
dimension  multiplied  by  4  produces  a  figure  that  is  a  first  approximation  of  equivalent 
inches,  or,  if  divided  by  3,  a  first  approximation  of  equivalent  feet,  either  of  which  is 
exactly  101.  6  per  cent  of  the  exact  equivalent.  We  may  divide  the  first  approximation 
by  1.  016  to  obtain  the  exact  equivalent,  or  reduce  the  first  approximation  by  l/6  -  10  per 
cent  of  itself  to  obtain  the  second  approximation,  with  certain  knowledge  that  the 
remaining  error  will  not  be  as  much  as  0.  1  per  cent. 

The  decimal  point  is  the  same  problem  in  mock  metric  as  in  standard  metric.  In 
any  case,  it  requires  attention  to  ensure  cranking  out  answers  having  an  error  factor 
of  less  than  10! 

Mock  metric  defies  Parkinson's  Law.  It  is  for  the  man  in  a  hurry,  for  mental  cal¬ 
culation,  for  solution  by  inspection,  for  the  fun  of  empathy  with  the  work  you  do! 

Would  you  like  to  see  an  actual  mock-metric  scale?  Just  take  any  decimal-inch 
scale  and  you  have  it  in  your  hands.  Every  0.4-in.  interval  is  a  mock  centimeter, 
every  0.  040-in.  interval  a  mock  millimeter.  Only  the  legend  has  been  arranged  to 
read  in  terms  of  inches! 

E.  C.  Ingraham 
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